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Sequence 



<213> OrganismName : Escherichia 
5 <400> PreSequenceString : 

atgaaaagac ttcacaagag gttcctctta 
ctccaggccg ccgatgtcac tatcactgtt 
attcaaacca aagaagctaa cgttaatctc 
cctggttctg catctggctg gcacaatatt 

10 acaagtgcag tgacggcaat cgtgacaggt 
gaaggtactg ccgaaaatat tcagatagag 
aatggcgata gcaaaacggt tattgttgat 
aaggcaagag ctatcacggt gaatggaaac 
aatgtgatct acacctggca ataa 

15 <212> Type : DNA 

<211> Length : 504 

SequenceMame : SEQ ID 431 
SequenceDe script ion : 



COli 0157 :H7 

gctacgtttt gcgcgttatt 
aatggtcggg tagtcgctaa 
ggggatcttt atacgcgcaa 
actttgtcat taaccgattg 
tcaactgaca atacgggtta 
cttagggatg accaggatgc 
gagatcactc gtaatgcaca 
gcaagccagg gaacgatcga 



aacagcaact 
accctgcact 
tctgcaacaa 
tccggctgaa 
ttacaaaaat 
gacgttaaaa 
gtttccactt 
ggcgctaatc 



60 
120 
180 
240 
300 
360 
420 
480 
504 



20 



Sequence 



25 



30 



35 



40 



45 



<213> OrganismName : Escherichia 
<400> PreSequenceString : 
atgcgggcga aacttctggg aatagtcctg 
tctaccgaga ctttatcgtt tactcctgac 
ctgagcggaa aaacaaaaga gcgtgtttat 
caactcgact ggaaattcaa taacgctgca 
atgccccaga tatctatcgg ggctgctggc 
atggtcgatc aggactggat ggattccagt 
caccctgata cacaactcaa ttatgccaac 
ctcaacgaac ccaattaccg cctgggactc 
tttacagcca gaggtggttc ctatatctac 
ggctccttcc cgaatggaga aagagcaatc 
attggcttga ctggaagtta tcgttatgaa 
agcggctggg tggaagcatc tgataacgat 
tatcgcagta aagtcaaaga ccaaaattac 
gtaacaccta acgcaaaagt ttatgttgaa 
ggtaatactt cactttatga tcacaatgat 
ggcatagaaa actataactt catcactact 

<212> Type : DNA 

<211> Length : 954 

SequenceMame : SEQ ID 432 
SequenceDe script ion : 

Sequence 



COli 0157:H7 



acaaccccta 
aacataaatg 
ctagccgaag 
attattaaag 
tggacaactc 
aaccccggaa 
gaatttgatc 
atggccggat 
agttctgagg 
ggctacaaac 
gattttgagc 
gagcactatg 
tattctgttt 
ggcacatgga 
aacacttcag 
gctggtctta 



ttgcgatcag 
cggacattag 
aaggaggccg 
gtgcaattaa 
tcggcagccg 
cctggacgga 
tgaatatcaa 
atcaggaaag 
agggattcag 
aacgttttaa 
taggtggcac 
acccaggaaa 
cagtcaatgc 
atcgggttac 
actacagcaa 
agtacacctt 



ctcttttgct 
tcttggaact 
aaaagtcagt 
ttgggatttg 
aggtggcaat 
tgaaagtaga 
aggctggctc 
ccgttatagc 
agatgatatc 
aatgccctac 
atttaaatac 
aagaatcact 
aggttattac 
gaataaaaaa 
aaatggtgca 
ttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
954 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

50 ttgtttttta agcgaggaaa gattttgagt gcgggacgcc tgaataaaaa atctctgggt 60 

atcgtgatgt tgttatcggt tggactgctt ttggcgggct gttcgggtag caaatcatcc 120 

gatacaggaa cgtattccgg ctccgtttac accgtgaaac ggggggatac gctatatcgt 180 

atttcgcgca ccacgggaac cagcgtaaaa gagctggcgc gactgaacgg catttccccc 24 0 

ccttacacca ttgaagttgg tcagaaacta aaactgggtg gggcgaaaag tagcagtagt 3 00 

55 acacgtaaat caaccgccaa atcaacgacc aaaaccgcat cggttacacc gtcatcagcg 3 60 

gtaccgaaat cttcctggcc gccagtaggg caacgttgtt ggttatggcc aacgacaggg 420 

aaagttatca tgccgtattc gacagcagat ggcggcaata aagggattga tatctcagct 48 0 

ccacggggta cacctattta cgccgcgggt gcaggaaagg tggtgtatgt gggcaaccag 540 

ctgcgtggct acggtaatct catcatgatt aaacacagtg aagattacat tacggcttac 600 

60 gcccataatg acacgatgct ggtaaataat gggcaaagcg tgaaggctgg gcaaaaaatc 660 

gccaccatgg ggagcacgga tgcggcatct gttcgcctgc atttccagat tcgttaccgt 720 

gcaacggcaa ttgatccgct acgttacttg ccgcctcagg gcagcaagcc aaaatgctga 78 0 



<212> Type : DNA 
65 <211> Length : 780 

SequenceName : SEQ ID 433 
SequenceDescription : 
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25 



40 



45 



Sequence 



<212> Type : DNA 

<211> Length : 717 

SequenceName : SEQ ID 435 
SequenceDescription : 

Sequence 



120 
180 
240 



Sequence 

<213> OrganismName : Escherichia coli 0157:H7 
5 <40 0> PreSequenceString : 

atgccaacac caaatcctct ggcaccggtg aaaggggccg ggaccacact gtgggtttat 60 
aacgggaacg gcgacccata tgcaaacccg ctttcagaca atgactggtc gcgtctggca 
aaggttaaag acctgacgcc cggcgaactg accgctgagt cctatgacga cagttatctc 
gatgatgaag atgcggactg ggccgcgacc ggacaggggc agaaatccgc tggagatacc 

10 agcttcacgc tggcgtggat gcccggagag caggggcagc aggcgctgct ggcgtggttt 300 
aatgaaggtg atacccgtgc ctataaaatc cgcttcccga acggcacggt cgatgtgttc 360 
cgcggctggg tcagcagtat cggtaaggcg gtgacggcga aggaagtgat cacccgcacg 42 0 

gtgaaagtca ccaacgtggg acgtccgtcg atggcagaag atcgcagcac ggtaacagcg 480 
gcaaccggca tgaccgtgac gcctgccagc acctcggtgg tgaaagggca gagcacgacg 540 

15 ctgaccgtgg cattccagcc ggaaggcgca accgacaaga gcttccgtgc ggtgtctgcg 600 
gataaaacaa aagccaccgt gtcggtcagt ggtatgacca tcaccgtgaa aggtgttgct 660 
gcaggcaagg tcaacattcc ggtcgtatcc ggtaatggtg agtttgctgc ggttgcagaa 720 
atcaacgtca ccgccagtta a 741 
<212> Type : DNA 

20 <211> Length : 741 

SequenceName : SEQ ID 434 
SequenceDescription : 



<213> OrganismName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

atgtctgctt tgtatgaacg ctcacagctg acgcaggtga tgatttcatc tgccccggcg 60 

actgctgaaa ctatggataa ggcggaatat ctgcgcctgg actgcaccat caaggaagtg 12 0 

30 cagttcaccg ccgggcagaa acaggatatt gatgtgacca cgctctgctc cacagagcag 18 0 

gagaacatca atggtctggg ggcgtcgtct gagatttcca tgtcgggcaa tttttatctg 240 

aatcaggccc agaacgccct gcgtgatgcc tatgacaatg acgcgttgta tgcgtttaag 300 

gtgctgtttc cgtccggtaa gggctttaaa ttcctggcgg aagtgcgcca gcacacctgg 3 60 

tcatccggta ccaacggcgt ggtggctgca acgttctcac tgcgtctgaa aggcaaaccg 420 

35 gtgtcctttg tggtaccgct ggcgtttgtg aaaaatctgg ataagacact taccgtgaat 48 0 

accggtgcgc tgctgacaat gtcagtcagt gccaacgggg gaacgccgcc gtataaatac 540 

gcctggaaga aggatggtca gccggttgac gggcagacga cagacacctt cagtaagcca 60 0 

ggtgcgcagt ccgctgatgc ggggaaatat acctgtgtgg tgaccgattc ggcagagaaa 660 

gcacagagtg tgacgtctgt tgaatgcacc gtgacagtga gcgcagccgc cggataa 717 



<213> OrganismName : Escherichia coli 0157:H7 
<40 0> PreSequenceString : 

50 atgaaaaaga gcactctggc attagtggtg atgggcattg tggcatctgc atccgtacag 60 

gccgcagaaa tatataacaa agacggtaat aaactggatg tctatggcaa agttaaagcc 120 

atgcattata tgagtgataa cgacagtaaa gatggcgacc agagttatat ccgttttggt 180 

tttaaaggcg aaacacaaat taacgatcaa ctgactggtt atggtcgttg ggaagcggag 240 

tttgccggaa ataaagcgga gagtgatact gcacagcaaa aaacgcgtct cgcttttgcc 3 00 

55 ggattgaagt ataaagattt gggttctttc gactatggcc gtaacctggg cgcgttgtat 3 60 

gacgtggaag cctggaccga tatgttcccg gaatttggtg gcgactcctc ggcgcagacc 420 

gacaacttta tgaccaaacg cgccagcggt ctggcgacgt atcggaacac cgacttcttc 48 0 

ggcgttatcg atggcctgaa cttaaccctg caatatcaag ggaaaaacga aaaccgcgac 540 

gttaaaaagc aaaacggcga tggcttcggc acgtcattga catatgactt tggcggcagc 60 0 

60 gatttcgcca ttagtggggc ctataccaac tcagatcgca ccaacgagca gaacctgcaa 66 0 

agccgtggca caggcaagcg tgcagaagct tgggctacag gtctgaaata cgatgccaat 72 0 

aatatttatc tggcaacttt ttattctgaa acacgcaaaa tgacgccaat aactggcggc 78 0 
tttgccaata agacacagaa ctttgaagcg gtcgctcaat accagtttga ctttggtctg 
cgtccatcgc tgggttatgt cttatcgaaa gggaaagata ttgaaggtat cggtgatgaa 

65 gatctggtca attatatcga tgtcggggct acatattatt tcaacaaaaa tatgtcagcg 
tttgttgatt ataaaatcaa ccaactggat agcgataaca aattgaatat taataatgat 

gatattgtcg cggttggcat gacctatcag ttttaa 1056 



840 
900 
960 
1020 
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<212> Type : DNA 

<211> Length : 1056 

SequenceName : SEQ ID 436 
SequenceDe script ion : 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Sequence 

<213> OrganismName : Escherichia 
<4 00> PreSequenceString : 
atgcgtgtca aacatgcagt agttctactc 
accatgactt tccagttccg taatccaaac 
ttattaaata gcgctcaggc ccaaaactct 
ggtattgaaa caccctcagc gttagataac 
99tgggctac tgtcgaatat taataccggt 
attgttgata ttgctaaccg cgatggtcaa 
ggacaaacct cgaccatcca ggtttcgggt 

<212> Type : DMA 

<211> Length : 417 

SequenceName : SEQ ID 43 7 
SequenceDescription : 

Sequence 

<213> OrganismName : Escherichia 
<4 00> PreSequenceString : 
atgaaaagaa aagttctggc aatgctggtc 
gcggctgaaa tttataataa agatggcaat 
ctgcactact tctctgatga tgctagcagc 
ttcaaaggtg aaactcagat cgctgaccaa 
attggcgcaa acggtcctga aagcgacaag 
ggtttcggct ttggtcagaa tggtactttc 
gacgtagaag catggaccga tatgctgcca 
gacaacttca tgaacggtcg tgctaacagc 
ggtcaagttg atggtctgaa ctttgcactc 
ttatttgatc aagaaggttc aggtaacggt 
gacggttcag tatgtccact tcctatgact 
<212> Type : DNA 
<211> Length : 645 

SequenceName : SEQ ID 438 
SequenceDescription : 

Sequence 



coli 0157:H7 

atgcttattt cgccattaag 
tttggtggta acccaaataa 
tataaagatc cgagctataa 
tttactcagg ccatccagtc 
aaaccgggcc gcatggtgac 
ttgcagttga acgtgacaga 
ttacaaaata actcaaccga 



ttgggctgga 

tggcgctttt 

cgatgacttt 
acaaatttta 
•caacgattat 
tcgtaaaacc 
tttttaa 



coli 0157:H7 



ccggcgttat 
aaactggatt 
gatggcgaca 
ttcactggtt 
ggtaataccg 
gactatggtc 
gaatttggtg 
gtagcaacct 
cagtatcagg 
aatggacgta 
ttgactttgg 



tagttgctgg 
tgtacggaaa 
tgtcatatgc 
atggtcagtg 
caacgcgtct 
gtaactacgg 
gagataccta 
atcgtaacaa 
gtaacaacga 
aacttgctaa 
tttaa 



cgcagcaaat 
agtagcgggc 
ccgtatcggt 
ggaatttaac 
ggcatttgca 
tgtcgtatat 
tgctggcgct 
tggtttcttt 
gaaaagcgga 
agagaacggc 



<213> OrganismName : Escherichia 
<400> PreSequenceString : 
ttgaacacgg tgactctgga aggaggaacc 
gtaaaaattg agaaaaacag caacgcggta 
caacttcacg atggtacggt gaataacagc 
ggcgatgcgg tattcaataa ccttgcaggc 
aactctgcgg tagtgaataa cgcgggtacc 
aatgccggga cgctggatat tgacgataag 
cttgataaca gcaaaaacgc cattcgcttc 
aacaccggtg aaatgacgct ggatgccgca 
ggcgccagcg agtttattaa caagggcgtg 
actgccggtg cgacagaaag cgatgccttc 
ttcgataaag acaacgccag cgcggttaaa 
gatggtgtaa tgaacatcag cggcaacaac 
cagctggtta acaacggcgt tatcaatctc 
actggtatgc aactggatgc caatgccacc 
atcaatattt ttgctaacga ctcgtttgcg 
gttaacaacg gtacggtggt gattgccgac 
ggcgacagcg tgaatgtgga aggggtgaac 
tacaccgact acacgttgcc ggacatgcca 
acgactgata gcggtagtag cgatggcagc 
accaacgttg acggcagcgc cggtaaactg 
gggatcaaca ccggtttcgc cgccggtacg 



coli 0157:H7 



ttcaataaca 
attaataaca 
ggtattgcgt 
ggcgaggcgc 
tggaaaatgg 
tcaacgttca 
cagggcagca 
ttaggtgcgg 
gtggatgcga 
ttgtggaacc 
ttcacccaca 
gccgtggcga 
ggtaccgaag 
gccgatgcgg 
ttcagcgtac 
ggcgtgactg 
ggcaacagcg 
aacacttaca 
agtaacaacc 
aaggtcaaca 
gcagacacca 



acggaacgct 
ccggttccct 
cggcgcgcgt 
gtaaaggcgc 
gttatcagga 
acaacagcgg 
atgctaacgc 
gcgctattct 
aagtcaccgt 
aggatggcgg 
acaactatgt 
tggaaggcga 
gcaccaccga 
taattgagaa 
tgggcacaga 
gttcggggtt 
gtaacaatac 
ccacttcacc 
tcaacggcta 
acgccagcat 
cggtcagttt 



taatgacgtc 
gtcaacttta 
taacgctcag 
gattctctat 
tgaaaacaac 
caaactcatc 
cacgttatat 
ctacgacgat 
ggcggtaagt 
ggtaattaac 
tgctctcaac 
taaaaacgca 
taccggattg 
caacggcacg 
aggtcatatc 
gattaagcag 
cgaagtgcat 
tttcagtgaa 
tatcgtcggt 
gaatggtgtc 
cgacaacgtg 



60 
120 
180 
240 
300 
360 
417 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
645 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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10 



15 



20 



25 



gtggaaggca tcaacctgac cgacgccgat 
gccaaaggca gcaccgatgc cagcggcaac 
accgatgtgg cgaccgatgc ttcggtgaac 
accaataacg agctgtatac cagcctgaac 
ctgaagcagg tgagcggtag ccaggcgacc 
aaccgcttca gcatgctggc ggatgccgcg 
gtggtggcga aaggtgaccc gcgtgcggaa 
gcactgcgta aaaccgttga cctgagcgaa 
gcgcgtctgg atggcgacgg tgcgcagaaa 
agccagttct ttggcctgaa gcaccagatg 
gcgctgcgtt atgacgtgca taatctcgac 
agcaaaacgg cggatacgga tgtgaaacag 
aaaacctttg agccgcgcga agggctgaaa 
cactcgctgg aaggcggcta tcaggagcgc 
agcggcagcg aaacggcggt ggacagcatc 
ggcggctgga gcgcgaatgc gacgctggaa 
cagcgcacgg caagccttgc aggggcaggc 
aagggcggcg gtatcaacag cctggcgagc 
tcgctgaatc tggatgcgta tcactggaaa 
ctgaacttta agaaaacgtt ctaa 
<212> Type : DNA 
<211> Length : 2424 

SequenceName : SEQ ID 43 9 
Sequ.enceDe script ion : 

Sequence 



gccatcacct 
gttgacgtca 
gatgtggcga 
gtgggcacca 
acggtattcc 
ccgaaggtgg 
ctcggaaata 
agccagagca 
gcgggcgaca 
tccttcgaca 
agcagccgct 
cagtacctgg 
atcaccccgt 
aatgccggag. 
gtcgggctga 
ggcgggccga 
agccagcact 
gtcggcgtga 
gaggacggca 



caacgtccgt 
tcatgagcaa 
aggcactgga 
ctgctgaact 
gtgaagcgcg 
gcaatggcct 
acaccgagta 
tgagcctgga 
atggcgtaac 
atggtatgcg 
cggtcgctta 
agttgcgtag 
acgccggagt 
actttaacct 
aactggacta 
acctgagcta 
ttaacgtcga 
agtacagtag 
tcagcgacaa 



ggtatggacc 
aaacgcctac 
tgcgggttac 
gaatagcgcc 
tgtgttaagc 
ggcctttaac 
tgacatgctg 
atacggtatc 
cggcggctac 
ttggaacaac 
cggcgacgtc 
cgaaggggcg 
gaaactgcgt 
gagcatgaac 
cgcagggaaa 
cagcaagagc 
tgacggtcag 
caaagaaagt 
aggcgtgatg 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
240 0 
2424 



<213> OrgariisttiName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

30 atgcttaatg gaattagtaa cgctgcttct acactagggc ggcagcttgt aggtatcgca 60 

agtcgagtga gctctgcggg gggaactgga ttttctgtag cccctcaggc cgtgcgtctt 120 

actccggtga aagttcattc ccctttttct ccaggctcgt cgaatgttaa tgcgagaacg 180 

atttttaatg tgagcagcca ggtgacttca tttactccct ctcgtccggc accgccgcca 240 
ccgacaagtgr gacagg-catc cggggcatcc cgacctttac cgcccattgc acaggcatta . 3 00 

35 aaagagcact tggctgccta tgaaaaatcg aaaggtcctg aggctttagg ttttaagccc 3 60 

gcccgtcagg caccgccgcc accgacaagt ggacaggcat ccggggcatc ccgaccttta 420 

ccgcccattg cacaggcatt aaaagagcac ttggcfcgcct atgaaaaatc gaaaggtcct 480 

gaggctttag: gttttaagcc cgcccgtcag gcaccgccgc caccgacaag tggacaggca 540 

tccggggcat cccgaccttt accgcccatt gcacaggcat taaaagagca cttggctgcc 60 0 

40 tatgaaaaat cgaaaggtcc tgaggcttta ggttttaagc ccgcccgtca ggcaccaccg 660 

ccaccgacag ggcctagtgg actaccgccc cttgcacagg cattaaaaga tcatttagct 720 

gcctatgagc aatcgaagaa agggtaa 747 
<212> Type : DNA 
<211> Length : 747 

45 SequenceName : SEQ ID 440 

SeqixenceDescription : 



Sequence 

50 <213> Organ israName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

atgaacaaga agattcattc cctggccttg ttggtcaatc tggggattta tggggtagcg 60 

caggcacaag agccgaccga tactcctgtt tcacatgacg atactattgt cgttaccgcc 120 

gccgagcaga acttacaggc gcctggcgtt tcgaccatca ccgcagatga aatccgcaaa 18 0 

55 aaccccgttg cccgcgatgt ttcggagatc atccgtacca tgccaggcgt taacctgacc 240 

ggtaactcca ccagtggtca gcgagggaat aaccgacaga ttgatattcg cggtatgggt 3 00 

ccggaaaaca cgctgatttt gattgacggc aagccggtaa gcagccgtaa ctcggtgcgt 3 60 

cagggctggc gtggcgagcg cgatacccgt ggtgatacct cctgggtgcc acctgaaatg 420 

attgaacgta ttgaagttct gcgtggtccg gcagctgcgc gttatggcaa cggcgcggcg 480 

60 ggcggcgtgg ttaacatcat taccaaaaaa ggcagcggcg agtggcacgg ctcctgggac 540 

gcatatttca atgcgccaga acataaagag gaaggtgcca ccaaacgcac taactttagc 600 

ctgaccggtc cgctgggcga cgaattcagc ttccgcttgt atggcaacct cgacaaaacc 660 

caggctgacg cgtgggatat caaccagggt catcagtccg cgcgtgccgg aacgtatgcc 720 

acgacgttac cagccgggcg cgaaggggtg atcaataaag atattaatgg cgtggtgcgc 780 

65 tgggacttcg caccattgca atcgctggaa ctggaagcgg gttacagccg ccagggtaac ' 840 

ctgtatgcgg gcgataccca gaacaccaac tctgacgctt acactcgctc gaaatatggc 900 

gatgaaacca accgcctgta tcgccagaac tactcgctga cctggaacgg tggctgggat 960 
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aacggcgtga ccaccagcaa ctgggtgcag 
gaaggtctgg cgggcggtac cgaagggaaa 
gafcaacgatc ttgatgacgt gatgctgcat 
gtfcaaccaga cgctgacgct gggtacggag 
5 tccaacaccc aggcactgac cggaacgaat 
accgaccgta gcccgtattc aaaagcagaa 
gagctgactg acagcaccat cgtaacgccg 
ggcaataact ggagcccggc gctgaacata 
aaaatgggca ttgcccgcgc ctataaagcg 

10 attctctaca gtaaaggtca gggctgctafc 
aacgatgacc tgaaagcaga aaccagcatc 
gacggctggc tggcgggcat cacctggttc 
ggctatgttg ctgtagggca aaacgcagtc 
... . ccgaaagcgg tggttgaagg tctggaagga 

15 atgtggacta ataacatcac ttatatgctg 
ttgtcgatca tcccggagta tacgttgaac 
ttcjtcgatgc aaacgacctt cacctggtac 
aaaggtcagc cagcggttgg accggaaacc 
ctg-agcgcga cgtgggatgt gacgaagaat 

20 ttcgacaaac gtttgtggcg tgcgggtaat 
aactatatcg ccggtgccgg ggcgtatacc 
agcgtaaaca ctcacttctg a 
<212> Type : DNA 
<2X1> Length : 2241 

25 SequenceName : SEQ ID 441 

SequenceDescription : 



tacgaacaca cccgtaactc gcgtattccg 1020 

tttaacgaaa aagcgacaca ggatttcgtc 1080 

agcgaagtta acctgccgat tgatttcctc 1140 

tggaatcagc agcggatgaa ggacttaagt 1200 

accggtggcg ctattgatgg cgtgagtgcc 1260 

attttctcgc tgtttgccga aaacaacatg 1320 

gggctgcgtt tcgatcatca cagtattgtc 1380 

tcgcaaggtt taggcgatga cttcacgctg 1440 

ccgagcctgt accagactaa cccgaactac 1500 

gccagcgcgg gcggctgcta tctgcaaggt 1560 

aacaaagaga ttggtctgga gttcaaacgc 1620 

cgtaacgatt atcgcaataa gattgaagca 1680 

ggcaccgatc tctatcagtg ggataacgtg 1740 

tcgttaaacg .taccggfctag cgaaapggtg 1800 

aagagtgaaa acaaaaccac gggcgaccgt 18 60 

tcaacgctga gctggcaggc acgagaagat 192 0 

ggtaagcagc agccgaagaa gtacaactat 19 8 0 

aaagaaatta gtccttacag cattgttggg 2040 

gtcagtctga ccggcggcgt ggacaatctg 210 0 

gcccagacca cgggcgattt ggcaggggcc 2160 

tataacgagc cgggacgtac gtggtatatg 222 0 

2241 



Sequence 



120 
180 



30 <2X3> OrganismName : Escherichia coli 0157:H7 
<4O0> PreSequence String : 
ttgggagggc gatttagtct caggtacaaa aaactctcat atagattcgt ttttctgacc 60 
ttggcaggtt gttcttcagt aggcaaccag tcattgaaaa atgagacgca ggaaagtgtg 
aaaaccaaaa ttgttaaagg caaaactaca aaacaggacg tgttagcatc gttcggtgaa 

35 cctgacagcc gttctttgat cgatggtgaa gaacaatggt catacactat gtataacagc 240 
cagtccaaag caacctcttt catccccgtt gtgggactgc ttgcaggtgg cgcagactca 300 
caaactaaat ctctgacagt ttctttcaaa ggcgaaaaag tcagcacata catctttaat 360 
gctggaacaa gcaacgtgaa gactggcatt ttttag 396 
<2X2> Type : DNA 

40 <2X1> Length : 396 

SequenceName : SEQ ID 442 
SequenceDescription : 



Sequence 
45 

<213> OrganismName : Escherichia 
<4O0> PreSequenceString : 
atgaaaaaaa ttgcatgtct ttcagcactg 
tccgtagctg cgacttctac cgtaactggc 

50 atgaacaaaa tgggcggttt caacctgaaa 
ggtgtgatcg gttcattcac ttacaccgag 
aacaaaaacc agtactacgg catcactgct 
agcatctacg gtgtagtggg tgtgggttat 
tacaaacacg acaccagcga ctacggtttc 

55 atggaaaacg ttgctctgga cttctcttac 
ggcacctgga ttgccggtgt tggttaccgc 
<2X2> Type : DNA 
<2X1> Length : 516 

SequenceName : SEQ ID 443 

60 SequenceDescription : 



coli 0157 :H7 

gccgcagttc tggctttcac cgcaggtact 60 

ggttacgcac agagcgacgc tcagggccaa 120 

taccgctatg aagaagacaa cagcccgctg 18 0 

aaaagccgta ctgcaagctc tggtgactac 240 

ggtccggctt accgcattaa cgactgggca 3 00 

ggtaaattcc agaccactga atacccgacc 360 

tcctacggtg ctggtctgca gttcaacccg 42 0 

gagcagagcc gtattcgtag cgttgacgta 480 

ttctaa " 516 



Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
65 <4O0> PreSequenceString : 

atgaagagta tagcaacact ggttgtgtgt gcaatctccg ggattgcctg tgtaaattta 60 

tctgcacatg cagcagaagg agagcataca atttctctgg ggtatgcgca ctttcagttt 12 0 
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ccgggactga aggattttcgt aaaggatgcg actgctcata acagggagac tttcagtcat 180 

ttcgtcaaca gaaactactt ttcttcattg ggcgaatata cagatggtcg ggtcagtgga 240 

tatgaaggca aggataaaaa tccacagggc attaatatca ggtatcgcta cgagataacg 3 00 

gatgattttg gcgttatcsic ctcttttaca tggacgcgtt ctctcactaa ctcacagaca 3 60 

5 tttattgatg tgcagtcagc cgatcatacc aggaagatta agaatccggc agcttctgcc 420 

agaacggata tcagggcgaa ttactggagt ctgttagcgg ggccttcatg gcgggttaat 48 0 

cagtacatga gtttatatcjc gatggcaggg atgggcgttg ctaaagttag cgctgacctg 540 

aaaattaagg acaatattaa cagtagtggc ggattttctg aaagcaacag cacgaaaaaa 600 

acctcccttg cgtgggctgc aggtgcacag tttaacctga atgagagtgt tacactggat 660 

10 gtggcttacg aaggttccgg ctctggcgac tggcgcacga gtggcgttac tgctggcatt 720 

ggcctgaaat tctga 735 
<212> Type : DtfA 
<211> Length : 73 5 

SequenceName z SEQ ID 444 

15 SequenceDe script ion : 

Sequence 

<213> OrganismName = Escherichia coli 0157:H7 

20 <400> PreSequenceStaring : 

atgcgtaaac tttatgccgc cattttgtcc gcagccattt gtctgaccgt atccggtgcg 60 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgtctcgacg 120 

aacgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaattcacg 180 

gacacgctgg ggctggtgaic gtcattcagc tatgcaggag acaggaatcg ccagattacc 240 

25 cgttacagcg atacccgctg gcatgaagat tccgtgcgta accgctggtt cagcgtaatg 3 00 

gcggggccgt ctgtgcgcgt gaatgaatgg ttcagcgcgt atgcgatggc gggagtggct 3 60 

tacagccgtg tgtcgacttt ctccggggat tatctccgcg taactgacaa caaggggaaa 420 

acgcacgatg tgctgaccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 

ggggctggcg tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 

30 tccggcagtg gcgactggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 

<212> Type : DMA 
<211> Length = 60 0 

SequenceName z SEQ ID 445 
35 SequenceDescription : 

Sequence 

<213> OrganismEFame z Escherichia coli 0157 :H7 

40 <400> PreSequenceS tiring : 

atgcgtaaac tttatgccgc cattttgtcc gcagccatct gtctggccgt atccggcgcg 60 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgcccggacg 120 

agcgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaatttacg 180 

gacacgctgg ggctggtgsc gtcattcagc tatgcaggag acaagaatcg ccagcttacc 240 

45 cgttacagcg atacccgctg gcatgaagat tccgtgcgta accgctggtt cagcgtaatg 3 00 

gcggggccgt ctgtgcgcgt gaatgaatgg ttcagcgcgt atgcgatggc gggtgtggct 3 60 

tacagccgtg tgtcgacttt ctccggggat tatcttcgcg taactgacaa caaggggaaa 420 

acgcacgatg tgctgaccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 

9999 ct 99 c 9 tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 

50 tccggcagtg gcgactggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 

<212> Type : DNA 
<211> Length : 600 

SequenceName = SEQ ID 446 
55 SequenceDescription : 

Sequence 

<213> Organ ismName = Escherichia coli 0157:H7 

60 <400> PreSequenceS taring : 

atgcgtaaac tttatgccgc cattttgtcc gcagccatct gtctggccgt atccggcgcg 60 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgcccggacg 120 

agcgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaatttacg 180 

gacacgctgg ggctggtgac gtcattcagc tatgcaggag acaagaatcg ccagcttacc 240 

65 cgttacagcg atacccgctg gcatgaagat tccgtgcgta accgctggtt cagcgtaatg 3 00 

gcggggccgt ctgtgcgcgt gaatgaatgg ttcagcgcgt atgcgatggc gggtgtggct 360 

tacagccgtg tgtcgacttt ctccggggat tatcttcgcg taactgacaa caaggggaaa 420 



WO 2005/076010 



146/341 



PCT/IN2005/000037 



acgcacgatg tgctgaccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 
ggggctggcg tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 
tccggcagtg gcgactggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 

<212> Type : DNA 
<211> Length : 600 

SequenceName : SEQ ID 447 

SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

atgcgtaaac tttatgccgc cattttgtcc gcagccatct gtctggccgt, atccggcgcg $0 

cctgcatggg cgtctgagca gcaggccacg ctgagcgcgg ggtatcttca tgcccggacg 120 

agcgctcccg gtagcgataa tcttaacggg attaacgtga aataccgtta tgaatttacg 180 

gacacgctgg ggctcrgtgac gtcattcagc tatgcaggag acaagaatcg ccagcttacc 240 

cgttacagcg atacccgctg gcatgaagat tccgttcgta accgctggtt cagcgtaatg 300 

gcggggccgt ctgtgcgcgt gaatgaatgg ttcagcgcgt atgcgatggc gggtgtggct 3 60 

tacagccgtg tgtcgacttt ctccggggat tatcttcgcg taactgacaa caaggggaaa 420 

acgcacgatg tgctgaccgg aagtgatgac ggtcgccaca gcaacacgtc tctggcgtgg 480 

ggggctggcg tgcagtttaa cccgaccgaa tccgtggcca ttgatattgc ttatgaaggc 540 

tccggcagtg gcgactggcg cactgacggt ttcatcgtgg gtgtcggtta taagttctga 600 

<212> Type : DNA 
<211> Length : 6 00 

SequenceMame r SEQ ID 448 

SequenceDe script ion : 

Sequence 



<213> OrganismlsFame : Escherichia coli 0157:H7 
<400> Pre&equeaceString z 

atggtcatga gccagaaaac cctgtttaca aagtctgctc tcgcagtcgc agtggcactt 60 

atctccaccc aggcctggtc ggcaggcttt cagttaaacg aattttcttc ctctggcctg 120 

ggccgggctt afctcagggga aggcgcaatt gccgatgatg caggtaacgt cagccgtaac 180 

cccgcattga ttaccafcgfct tgaccgcccg acattttctg cgggtgcggt ttatattgac 240 

ccggatgtaa atafccagcgg aacgtctcca tctggtcgta gcctgaaagc cgataacatc 300 

gcgcctacgg catgggttcc gaacatgcac tttgttgcac cgattaacga ccaatttggt 360 

tggggcgctt ctattacctc taactatggc ctggcaacag agtttaacga tacttatgca 42 0 

ggcggctctg tcgg-gggtac aaccgacctt gaaaccatga acctgaactt aagcggtgcg 48 0 

tatcgcttaa ataatgcatg gagctttggt cttggtttca acgccgtcta cgctcgcgcg 54 0 

aaaattgaac gtttcgcagg cgatctgggg cagctggttg ctggtcagat tatgcaatct 600 

cctgccggga agactcctca agggcaagca ttggcagcta ccgccaacgg tatcgacagt 66 0 

aataccaaaa tcgctcatct gaacggcaac cagtggggct ttggatggaa cgccggtatc 720 

ctgtatgaac tggataaaaa taaccgctat gcactgacct accgttctga agtgaaaatt 780 

gacttcaaag gtaactacag cagcgatctt aatcgtgtgt ttaataacta cggtttgcca 840 

attcctaccg ccacaggtgg cgcaacgcaa tcgggttatc tgacgctgaa cctgcctgaa 90 0 

atgtgggaag tgtcgggtta taaccgtgtt gatccgcagt gggcgattca ctatagcctg 960 

gcttacacca gctggagtca gttccagcag ctgaaagcga cctcaaccag tggcgacacg 102 0 

ctgttccaga aacatgaagg ctttaaagat gcttaccgca tcgcgttggg taccacttat 1080 

tactacgatg ataactggac cttccgtacc ggtatcgcct ttgatgacag cccagttccg 1140 

gcacagaatc gttctatctc cattccggac caggaccgtt tctggctgag tgcaggtacg 1200 

acttacgcgt ttaataaaga tgcttcagtc gacgttggtg tttcttatat gcacggtcag 1260 

agcgtgaaaa ttaacgaagg cccataccag ttcgagtctg aaggtaaagc ctggctgttc 1320 

ggtactaact ttaactacgc gttctga 1347 
<212> Type : DN"A 
<211> Length : 1347 

SequenceName : SEQ ID 449 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequerxceString " : 

atggcctttt ctcaagcggt tagcggafcta aacgctgccg ccaccaacct cgatgttatt 60 
ggcaacaata tcgccaactc cgccacctac ggctttaaat caggcacggc ctcttttgcc 120 
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gatatgtttg ccggttcgaa 

tttaccgatg gcacgaccac 

ggttttttcc gtctggtaga 

aagctggatg aaaatcgtaa 
5 gcaaccggta cgccgccgac 

aataccctga tggcagcgaa 

agcgatccgc ttccctctcjt 

aaaggttcgg tgactgtttt 

gtgaagaccg gggataataa 
10 acagccgagc ctgcaatgaa 

acagagaata ttaccaccgg 

ttcctcaact ccatgcagca 

ggctacaaac cgggcgatct 

aactattcca acgaacaaac 
15 aacgaaggtc tggcatcccja 

gcgctgttgg ggacagccgg 

gcgtccaacg tcgatctcag 

cagtctaacg cccagaccat 

cgctaa 
20 <212> Type : DNA 

<211> Length : 1206 

SequenceName : SEQ ID 450 
SequenceDescription : 

25 Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceS taring : 

atgtctaaat cgactttttt acacattctg attagctcca tcatattggt ggcactaata 60 

30 caatcatccg cttgggctaa ctgtacaaac acacagatag gtcaaactga agatggtcgg 120 

acagcgctta ttgagttccjg aaaaattaat atgaccgaca cctattttgc gccagcaggt 180 

tcactcctcg cgacaacggt cgtacctccc actaactaca catcaggtgg cgcgacagga 240 

agctccgtat tgtgggaatg tgatgcaaca gatttgccaa acatctattt tctggtcgct 3 00 

accaatggcg acgaccgcgt ggggggcttt tacgatgcag gcgggcctga tggtctgagt 360 

35 gatgtctatg ccacctggtt cgcttttgtc ggtctcaagc agaccatggc gggcgtgacg 420 

cttggtcgtt actggaagaa agtgcccatc accagttatg ccactcaggg aactaaaatc 480 

cagattcgct tacaggatat ccctcctctt catgctgagc tttatcgcat cagtacgcta 540 

cctgatacat cagcaacaac aagttggtgc ggtaataata atacagatag tagtggagtc 600 

ggattcgcaa aaccttccgg tacaatctat aactgtgttc agcccaatgc ctatattcag 660 

40 ctttccggta ccagcggcat tttatttggt catgatgagc ccggcgaaga tagttctgtt 720 

cattgggatt tctggggtgc tgataatggt tttggttacg gaatgcgttc ggccaatcga 780 

ctctacaaca atgccacctg cgttgcccgc agcgccacgc cgttagtatt gctgccgaca 840 

attgcagaag cacaactgaa tgcgggcatg gaaagtaccg gtaattttaa tgtccgcgtc 900 

gagtgtagta actcggttca atcagggatt agcgatactc agacagcatt aggaatccag 960 

45 gtgtctgaag gtgcatatac agcggcgcaa aaactgggga ttatcaatag caacggcggc 1020 

gtcagcgccc tggtctctga taattatgac gcagcagaga tggcaaaggg cgttgggatc 1080 

tacatttcta acagtgctca ccccgatacg gcgatgacgc tggttggtca accgggcatc 1140 

gcgaagttaa cccccggagg aaatgcagcg gggtggtatc ctgtatttga aggggcaaca 1200 

ttagaaggtg cgactcaccc cggatactcc agctatagtt actcttttat cgcccggttg 12 60 

50 aagaaactgc caaatcagac agtcagtgcg ggaaaagtgc gggcaacggc ttatatattg 1320 

gtgaaaatgc aatga 13 35 
<212> Type : DNA 
<211> Length : 133 5 

SequenceName : SEQ ID 451 

55 SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 

60 <400> PreSequenceS t ring : 

atggaaaaca atcgcaattt ccctgccaga caatttcatt cgctcacgtt ctttgccggt 60 

ctttgtattg gcatcacgcc tgtggctcag gcactcgccg ccgaagggca aactaacgcg 120 

gatgacacgc tggttgtcga agcatcaacg ccttcgcttt atgcgccaca acaatctgcc 180 

gatccgaaat tctcgcgtcc ggtagcggat actacccgca cgatgacggt gatttctgaa 240 

65 caagtgatta aagatcaggg cgcaaccaac cttaccgatg cgctcaaaaa cgtccccggc 3 00 

gtgggtgcgt tttttgcggg tgagaacggt aactccacca ctggcgacgc catttatatg 3 60 

cgcggtgctg atacctctaa cagtatttat attgatggca ttcgcgatat cggcagcgtc 420 



agtgggactg ggggtaaaag ttgccggtat cactcaggac 180 

caacaccggg cgtggtctgg acgttgctat cagccagaac 240 

tagcaacggt tcggtgttct acagccgtaa cggacaattt 3 00 

cctggtgaat atgcaaggtt tacagctgac gggttacccg 360 

tattcagcaa ggggcgaatc cgactaacat ttcgatcccg 420 

aactaccacc acggcgtcga tgcagatcaa cctgaattcc 480 

taacgcattt gatgccagca atgcggatag ctataacaaa 540 

cgacagtcag ggtaatgctc atgacatgag cgtctacttt 60 0 

ctggcaggtc tacacccagg atagcagtga tccaacaggt 660 

gctggtgttt aatgccaatg gcgttctgac ctcaaatcca 72 0 

cgcaattaac ggcgcagaac ccgccacgtt tagcctgagc 780 

aaataccggc gctaacaaca ttgtggcaac cacccagaat 840 

ggtgagttat caaatcaatg atgacggtac ggttgtcggc 900 

ccaactgctg gggcagattg tactggcgaa^ctttgccaac 960 

aggcgacaac gtctggtctg cgacgcaatc ttctggcgtg 1020 

gacgggcaac tttggcaccc tgaccaacgg tgcgttggaa 108 0 

taaagaactg gtcaatatga tcgttgccca gcgtaactat 1140 

caaaacccag gaccagatcc tcaacacgct ggttaactta 12 0 0 

1206 
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tcgcgcgaca ccttcaatac tgagcaggtc 
tacgggcgca gcgcgccgac gggctcgatc 
tccggcattg acgcctccgc cagtattggc 
gtcaatcagg tcattggtga taccactgcg 
5 cacgatgccg gacgcgacaa agtcaaaaat 
tttggccttg gtacagcgaa tcgtttgtat 
acgccagacg gcggcattcc aaccatcggt 
acggcgaccc tgaatcattc cggaaaagtt 
gattacgacg attcgaccac cgacaccgcc 

10 aacaccacca ttcgcaacac tacccgttgg 
gcgattatgg gtggggcgtc gaatattacc 
tggtcacgca cggcgaatac caaagatgtg 
ctgacctcga cattctatac cgcttctatc 
acccgtgaaa cgcagactaa ctacggcgtt 

15 tatcatcctg acagcagcat tcatcccggc 
ggtcagacgg ataccttcgc aatttacgca 
gagctgaacg gcgggatccg tctggataat 
tgcggcggca gcggacgcgg tgccatcacc 
gtcaccaccg tcgacaccgc caagtcgggc 

20 tatcacctga cggaaaacgg caatgtctat 
ggcggcaaca acttcgccct tgcgcagtct 
tttaaaccgc aaaaagccaa caccagcgag 
cgcctgttgc tcaccgccgc gctgttccgt 
gatgacggga cttactcgca atacggtaag 

25 gcagggaata tcactcccgc gtggcaggtg 
atcaaaaacg gcaaagatgt tgcccaggat 
cacgccttca ccttatggag ccaatatcag 
gcacgctata tcggcagtat gcataaaggt 
accgaaggtt actgggtcgc cgatgccaaa 

30 ttccagctaa acgtctacaa cctgfcttgat 
ggctaccgtt atcacccggg cgagccaaga 
tga 

<212> Type ; DNA 
<211> Length : 2283 
35 SequenceName : SEQ ID 452 

SequenceDescription : 

Sequence 



40 <213> Organ ismName : Escherichia 
<400> PreSequenceString : 

atgcaaatga agaaattgct ccccattctt 

ttgagccagg ccgagaacct gatgcaagtt 

ttgcgtaagt ctgccgccga tcgtgatgct 

45 ccattactgc cacagctagg tttaggtgca 

gcgaacggca tcaactctaa cgcgaccagt 

gatatgtcga aatggcgtgc gttaacgctg 

acgtatcaga ccgatcagca aaccttgatc 

ttgaatgcta ttgacgttct ttcctataca 

50 ttagatcaaa ccacccaacg ttttaacgtg 

gcccgcgcgc agtacgatac cgtgctggcg 

aacgcggtag agcagctgcg ccagatcacc 

aatgtcgaaa actttaaaac cgacaaacca 

gaaaaacgca acctgtcgct gttacaggca 

55 attcgccagg cgcaggatgg tcacttaccg 

tctgacacct cttatagcgg ttcgaaaacc 

agcaatatgg gccagaacaa agttggcctg 

atggttaact cgcaggtgaa acaggcacag 

gaaagcgcgc atcgtagcgt cgtgcagacc 

60 tctatcagta gtattaacgc ctacaaacaa 

gcgatggaag cgggctactc ggtcggtacg 

accacgctgt acaacgccaa gcaagagctg 

cagctgaata ttaagtcagc cctgggtacg 

aatgcgctga gcaaaccggt ttccactaat 

65 cagaatgcta ttgctgatgg ttatgcgcct 

tccgcacgca ctaccaccag taacggtcat 

<212> Type : DNA 



gaagtgatta aagggccttc cggcaccgac 480 

aatatgatca gcaagcagcc gcgcaatgat 540 

agcgcctggt tccgccgcgg cacgctggac 60 0 

gtgcgcctga atgtgatggg cgaaaaaacg 660 

gagcgttacg gcgtcgcccc tfcctatcgct 720 

cttaattatc tgcatgtcac ccagcacaac 78 0 

ttgccgggct attctgcccc atctgcagga 840 

gatacgcata acttttacgg cacggatfccc 90 0 

accatgcgtt ttgagcacga tatcaacgat 960 

tcgcgcgtga agcaggatta cctgatgacg 102 0 

cagcccacca gcgatgtgaa tagctggacg 1080 

agtaataaaa ttctcaccaa ccagaccaac 1140 

ggtcatgatg tcagtaccgg cgtggaattt 1200 

aatccggtga cgttacctga ggtaaatatt 1260 

cjgcctgacgc gcaacggcgc aaacgccaat 1320 

ttcgatacgc tgcaaatcac ccgtgatttt 13 80 

tatcatactg aatatgacag tgccaccgcc 1440 

tgcccagctg gtgtggcaaa aggttctccg 150 0 

aatctggtga actggaaagc cggggcgctg 1560 

attaactatg ccgtttccca gcagcctccg 1620 

cjgcagcggta acagtgccaa ccgcaccgat 1680 

attggcacca aatggcaggt tctggataaa 1740 

actgatatcg aaaatgaagt tgagcaaaat 1800 

aaacgcgtcg aaggctatga gatatccgtg 18 60 

attggcggct atacccagca aaaagcaacc 1920 

ggttcctcat cgctgccgta taccccggag 1980 

cjcaaccgatg atatctctgt tggcgcgggc 2040 

-fccagacggcg cggtgggaac gccagcgttt 210 0 

ctggggtata gggttaafccg caatctcgac 2160 

accgattacg tcgcctcaat caataagagc 2220 

accttcttgc tcacagccaa tatgcatttc 2280 

2283 



coli 0157:H7 

atcggcctga gcctttctgg gttcagttcg 60 

tatcagcaag cacgccttag taacccggaa 120 

gcctttgaaa aaattaatga agcgcgcagt 180 

gattacacct atagcaacgg ctaccgcgac 240 

cjcgtccctgc agttaactca atccattttt 3 00 

caggaaaaag cagcagggat tcaggacgtc 360 

ctcaacaccg cgaccgctta tttcaacgtg 420 

caggcgcaaa aagaagcgat ctaccgtcaa 480 

ggcctggtag cgatcaccga cgtgcagaac 540 

aacgaagtga ccgcacgtaa taaccttgat 600 

ggtaactact atccggaact ggcggcgctg 660 

cagccggtta acgcgctgct gaaagaagcc 720 

cgcttgagcc aggacctggc gcgcgagcaa 78 0 

actctggatt taacggcttc tagcgggatt 840 

cgtggtgccg ctggtaccca gtatgacgat 900 

agcttctcgc tgccgattta tcagggcgga 960 

tacaactttg ttggtgccag cgagcaactg 1020 

gtacgttcct ctttcaacaa cattaatgct 1080 

grccgtagttt ccgctcaaag ctcattagac 1140 

cgtaccattg ttgatgtgtt ggatgcgacc 12 0 0 

gcgaatgcgc gttataacta cctgattaat 12 60 

ttgaacgagc aggatctgct ggcactgaac 13 20 

ccggaaaacg ttgccccgca aacgccggaa 13 80 

gatagcccgg cacccgtcgt tcagcaaaca 1440 

aaccctttcc gtaactga 1488 
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<211> Length : 1488 

SequenceName : SEQ ID 453 
SequenceDescriptioxi : 



Sequence 

<213> Organ isralSTame : Escherichia coli 0157:H7 
<400> PreSequenceString : 

gtgaccaaac tcaaacttct ggcacttgga gtgcttatcg caacgtctgc aggcgtagcg 60 
cacgctgaag gtaaattttc cctgggcgca ggcgtaggtg tcgttgaaca cccatataaa 120 
gattacgata ccgatgttta cccagtaccg gtaatcaact atgaaggcga taacttctgg 180 
ttccgtggct taggtggtgg ttactacctg tggaatgacg caacggataa actttcaatt 240 
accgcttact ggtcgccgct ttacttcaaa gctaaagaca gtggcgatca ccaaatgcgt 3 00 

cacctggatg accgtaagag cacaatgatg gctggtctgt. cttatgctca ctttacccag 360 
tacggttacc tgcgtaccac cctggctggc gataccctgg ataacagcaa cggcatcgtc 42 0 

tgggatatgg cctggttgta tcgttacacc aacggtggcc tgaccgtgac tccgggtatt 480 
ggtgtgcagt ggaacagcga aaaccagaac gaatactatt atggcgtatc gcgcaaagag 540 
tccgctcgca gcggtctgcg tggctataac tcgaacgaca gctggagccc ttacctggag 600 
ctgagcgcca gctacaactt cctcggcgac tggagtgttt acggtaccgc gcgctacacc 660 
cgtctgtctg atgaagttac tgacagcccg attgtggata aatcctggac tggcctgatt 720 
tctaccggga tcacctacaa attctga 747 
<212> Type : DNA 
<211> Length : 747 

SequenceName : SEQ ID 454 

SequenceDescription : 



Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgaaaaaaa cattactggc agccggtgcg gtactggcgc tctcttcgtc ttttactgtc 60 

aacgcagctg aaaacgacaa accgcagtat ctttccgact ggtggcacca gagcgttaac 120 

, gttgtcggaa gctatcacac ccgfcttcgga ccgcagatcc gcaacgatac ctaccttgag 180 

tacgaagcat tcgctaaaaa agactggttc gacttctatg gttatgcgga tgcgccggta 240 

ttcttcggcg gtaactccga tgcaaaaggt atctggaacc acggttctcc tctgtttatg 3 00 

gaaatcgaac cacgtttctc catcgacaag ctgaccaata ctgaccttag cttcggtccg 3 60 

ttcaaagagt ggtacttcgc gaacaactac atttacgaca tgggtcgtaa taaagatggt 420 

cgccagagca cctggtacat gggfcctgggt accgatatcg acactggcct gccgatgagc 480 

ctgtccatga acgtctatgc gaaataccag tggcagaact atggcgcagc gaacgaaaac 540 

gagtgggacg gttaccgttt caaaattaaa tactttgtgc cgattaccga tctgtggggc 600 

ggtcagctga gctacatcgg cttcaccaac ttcgactggg gttccgattt aggggatgac 660 

agcggtaacg caatcaacgg tattaagacc cgtactaata actctatcgc ttccagccat 720 

attctggctc tgaactacga tcactggcac tactctgtcg tagctcgtta ctggcacgac 780 

ggtggtcagt ggaacgacga tgcagaactg aacttcggca acggcaactt caacgttcgc 840 

tctaccggct ggggtggtta cctggtagta ggttacaact tctga 8 85 
<212> Type : DNA 
<211> Length : 885 

SequenceName : SEQ ID 455 

SequenceDescription : 



Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<4 00> PreSequenceString : 

atgctttcaa cacaatttaa cagcjgataat caatatcaag ccatcaccaa accgtcacta 60 

cttgccggtt gcatagcact ggcactatta ccttctgccg cttttgctgc accagccact 120 

gaagaaacgg tgattgttga ggcjttcagcc acagctccag atgatggcga aaatgattac 180 

agcgtaacgt ctacctctgc gggtaccaaa atgcagatga ctcaacgtga tattcctcag 240 

tcggtcacta ttgttagcca gcagcggatg gaagatcagc agttacaaac gctgggcgaa 300 

gtgatggaaa acacgctggg gatcagcaaa agtcaggcgg attccgatcg tgctctttat 3 60 

tattcccgcg gattccagat cgataactat atggttgatg gtatccccac ctattttgaa 420 

tcgcgctgga atctgggcga cgcactttct gatatggcac tgtttgaacg cgtagaagta 480 

gtgcgtggcg cgacaggact catgaccggg acgggtaatc catctgcggc aattaatatg 540 

gttcgaaaac acgcgaccag tcgtgaattt aaaggcgatg tctcggcgga atacggtagc 600 

tggaacaaag aacggtatgt ggcggattta caaagcccac tcaccgaaga cggtaaaatc 660 

cgcgcgcgaa ttgtcggcgg ctaccagaat aacgactcat ggctggaccg ctacaatagt 720 

gaaaagacct tcttctccgg cattgtcgat gctgatttag gcgatcttac gacgctttca 780 
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50 



55 



60 



gccggttacg aatatcagcg cattgatgtt 
tggaatactg atggcagcag caacagttac 
gcgtacaacg ataaagagat caacaaggtc 
acctggcaag cgacactgaa tgccacccac 
tatgtcgatg cctatgtaaa caaagcggat 
ggacctggct ttgattatgt cggcggcacc 
gcgctggatt tgttcgctga cggtagttat 
tttggtggca gttacagcaa acaaaacaat 
ccggatgaaa ttggcagttt ctacaacttt 
ccacagagcc tggcgcagga cgataccaca 
gtcacccttg ccgatccgct gcatctgatc 
gatacgctga cttacagcat ggagaaaaac 
gacatcaatg acaactggtc gacctacgcc 
gatcgtgaca gttcaggcaa atatctggct 
ctgaaatcgg actggatgaa tagccgtctg 
caggataatg tcgctcagtc caccggtaca 
tataaagcgg tggatgggac agtcagtaaa 
accgacaact ggcagctgac atttggcgca 
aacgccgtta atcctaatct gccacgcacc 
cctgtcatgc cagagttgac agtcggcggt 
gacaccgtga caccgtatgg caccttccgc 
ctcttcaccc gctaccaggt gacgaaaaac 
ttcgacaaaa cctacgatac caacgtggaa 
ttcagcatta ccggcacgta tcaattctga 
<212> Type : DNA 
<211> Length : 2190 

SequenceName : SEQ ID 456 
SequenceDescription : 

Sequence 



aatagcccaa 
gatcgcgcac 
tttatgaccc 
tctgaagtcg 
ggtatgctgg 
ggttggaaca 
gaattgtttg 
cgttacttca 
aatggcaatt 
catatgaaat 
ctcggcgcac 
cacaccacgc 
agctatacct 
ccaatcaccg 
accaccacgt 
cctatccccg 
ggggtggaat 

acgcgctata 
acggttaaaa 
ggtgttaact 
gccgagcaag 
ttctcgttac 
ggttctatcg 



cctggggcgg 
gcagtaccgc 
tgaagcagcg 
aatttgacag 
ttgggccata 
gtggcaaacg 
gfccgtcagca 
gttcatgggc 
tcccacaaac 
cgttatatgc 
gttataccaa 
cttacgctgg 
ctattttcca 
gtaa.caacta 
tagccatctt 
gcagcaacgg 
ttgaactcaa 
ttgcagagga 
tgttcaccag 
ggcaaaatcg 
gtagctacgc 
aggggaacgt 
tctacggcgc 



tttaccgcgc 
acctgactgg 
gtttgctgat 
caaaatgatg 
cagtaattat 
taaagttgat 
caatctaatg 
caacatcttc 
cgactggtca 
tgccactcgt 
ctggcgggtt 
tctggtgttt 
gccgcaaaat 
cgagctggg-t 
ccgtattgag 
cgaaaccgcc 

cggcgcaatt 

taacgaagga 
ctatcggttg 
cgtgtatacc 
gttggtggat 
caataacctg 
accgcgtaat 



<213> Organ ismName : Escherichia 
<400> PreSequenceString : 
atggcgcgat tccagtttaa aaaccgtaaa 
atggtgatgg gagaagctgc aattgctgca 
gtaacgccag ttgcccaatt atctttacag 
cccggtgtca ctcaacaggc ggcgcagata 
aaaagtgcct ggtatccgca cgttggatta 
gactccagcg gcaggcttga taacaacgtt 
tatgactttg gtaaaactaa caacgatatc 
cgctttaaat tgatggcaac cttaaccgat 
gaagtcagtc gttatcaggc tttgtgcgat 
aacgtctaca acatggcggc attgcgcgct 
ttacaggccc agacgcgtat tgccggaatg 
atggcgagcg ccaaagcgca actggcggtg 
gcgccacctg ctgaattagc cgagcagccg 
atcccgctgg tgctggcggc agaaaactta 
acgaaagcgc aatactggcc aacgctcagt 
agcgaccgct cgtattggga tgatcagcta 
ggcggcgcgg tttctgccca ggtgcaacag 
caggtcgaac aggcaaaact ggatgtgctg 
accggcgcac ggggtcgtga agaagccggt 
cgagatgtgt accaaaatga atataaattg 
gtcgaacaag atgtctttca ggcgcaatcg 
gtcgccgccg taaattatgc cgctgcggtg 
cagggcttat acaacgactt acccgacttg 
<212> Type : DNA 
<211> Length : 13 56 

SequenceName : SEQ ID 457 
SequenceDescription : 



coli 0157:KC7 



aataatggac 
ccgctgccac 
gaaagtatat 
ggtattggcg 
accggcaacg 
tcgtatggca 
aatctgcaaa 
gttgcagaga 
gcggctcaac 
aacgcaggcc 
cgctcaacgc 
ctcactggcg 
gtatcgctga 
cgccagtcag 
attcaggggg 
caactgaacg 
gccgaagggc 
caacgagcgt 
ttagcgcaat 
ggtaaacgca 
gctgaaataa 
aataacctta 
aaataa 



840 
900 
960 
1020 
108 0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
162 0* 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2190 



ttattttttt 


tataagtttt 


60 


aatgggcgaa 


tgccccagcg 


120 


tacgcgcctt 


tgcgcgaaat 


180 


aagcgcaaat 


tgatgaagcc 


240 


cggggccgtc 


ccgacaaacg 


300 


taaccctgac 


acaactggtg 


360 


ctgccgcccg 


tgacagctac 


420 


aaacggcgac 


fcgcctacatg 


480 


gcaatattca 


ctcgctggaa 


540 


tgaactcatc 


gtcggatgaa 


600 


tggagcaata 


tcaggcgcag 


660 


tacagccgga 


ggcgatagcc 


720 


agaatattga 


ttaccagtcc 


780 


cacagtacgg 


cgtggaaaaa 


840 


gtaaaacgcg 


ctaccagacc 


900 


ttaacgcgcc 


gctgtatcag 


960 


aacaaaaaat 


ctctgcctcg 


1020 


ctgtggcata 


tgcgaactgg 


1080 


ccgaaagtgc 


gcacaaaacg 


1140 


gtctgaatga 


tctattaacc 


1200 


atgccaatta 


tgatggctgg 


1260 


ttccattagc 


gggaattaaa 


1320 






1356 



Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
65 <4 00> PreSequenceString : 

atggctaagt tcacaccttc attctcagga atcaaaggtc gggcgctctt ttcactgctc 60 
tttgcagcac caatgattca tgcaaccgac actgcaacga ccaaagatgg cgaaacaatc 120 
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40 



actgttacag cggatgcaaa taccgcaact 
acttccacgg cgacattaac cgatatgccg 
gttagcgatc aggttctgga aaatcagaat 
gtcagtaacg tggtacagac caatacatta 
ggttttggtg ctaaccggga tggctccatc 
cgcagtttca acgccgccac agaacgtgtg 
tatggcattc tcgatcctgg tggattgatt 
ttccatggtt ctgtctcagc cacgtcctcc 
atcacaggtc ccattgaagg cactcagctg 
gaagattact ggcgaaactt cggtaaagag 
tggtttggtg ataatgcaac agtaaccatg 
ttcgatcgtg gaacgatttt cgaccttacg 
atacgttttg acgaaccgtt taatattaca 
gcagaatatc „ atctcaatag ccagtggaca 
aaatacagcg acaatcaggc tcgcgttacc 
cggcgtgttg atgcaactca gggatctacc 
caagggaatg ttgatattgc tgggttctat 
tattatgatc ttctgcgcac agatatgatt 
tacaaccccg tttatggcaa taccagcaaa 
cagacgatca aacaggagag ctactcagct 
aactggattg ccgtcgccgg gatccgctat 
cgtcctttta atgtcaatac tgacagccgc 
gtctacaaac tgacgccatc ggtatcctta 
caatcgtcaa ttgccagcta catcggagat 
gtcggggcaa aattcgagct gttcgatggt 
cataaacgta acgtgttgta taccgaaagt 
ggccgcgttc gttcaagagg ggtagaagtc 
aatatcattg ccagctacgg ctataccgat 
gggaaaccat tgccgaatgt tcctcgtcat 
cataacatgc caggcaataa cacactgacg 
cgttcggcaa ccaatggggc tgactattat 
gccgcataca aaatgaaatt gcagtatccg 
tttgataaaa cgtattacac ctcttccatc 
gatccgcgtg aagtgcaatt cacggtgaaa 
<212> Type : DNA 
<211> Length : 2142 

SequenceName : SEQ ID 458 
SequenceDescription : 

Sequence 



gaggccjaccg 
atgctggata 
gcaacgacgc 
ggcggaactc 
atgaccaacg 
gaagtcjctaa 
aacgtcgtga 
agttttggcg 
gcataccgcc 
cgcagtacat 
ctctattccc 
acgaaacagc 
gatggtcagt 
gcgqgctttg 
gcgtatgatg 
cagcgtatgc 
aatgagattc 
cgctgtaaaa 
tgcacaacgg 
tatgcscagg 
cagtattaca 
gatgaacaat 
tttgctaatt 
cttccaccgg 
ataaccgcag 
attggtgatg 
gaccttgcgg 
gctaacjgttc 
accggfctcgc 
tttggcggtg 
ctgccaggct 
gtcacactgc 
gccacraaata 
atggaatttt 



atggttatca 
tcccgcaggt 
tggatgaagc 
aggacgcctt 
gcctgcgaac 
aaggtccggc 
ccaagcgccc 
gaggcactgg 
tgacggggga 
ttattgcccc 
atcgggacta 
ccgtaaacgt 
ccgatctggc 
attacagcta 
caacgacagg 
attctactcg 
tgggtggggt 
acgctaaaga 
tttcggcgtc 
atgcgctcta 
cgcagtatgc 
ggacgcccaa 
attcgcaaac 
aatcatctaa 
atattgcgct 
aaaccatcgc 
gagcattaac 
tggaagatcc 
tattcctgac 
gtggacattg 
atttcgttgc 
aattaaacgt 
atctggggaa 
ga 



acctctgagc 

ggtcaatacg 

gctttataac 

tgtacgtcgt 

tgtacttcct 

ctccacgctg 

ggaaaaaaca 

gcaacttgat 

agtgcaggat 

gtcactcacc 

taaaactccg 

tgatcgaaaa 

gcaactcaac 

cagccaggat, 

aacgctgaca 

tgcggatctg 

gtcatatgaa 

tttcaatatc 

ggacagtgat 

tctgaccgat 

gggtaaaggc 

actggggtta 

atttatgccg 

tgcttacgaa 

gtttgatatc 

caaaacggca 

tgaaaacatt 

tgattatgca 

ctatgacatt 

cgtaagccgt 

cgatgccttc 

caaaaacctg 

tcagattggc 



180 
240 
300 
350 
420 
480 
540 
600 
660 
720 
780 
840 
900 
36.0. 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2142 



<213> OrganismName : Escherichia 
<400> PreSequenc eSt ring : 
atgcggactc tgcagggctg gttgttgccg 

45 caagaggcaa cggtgaaaga ggtgcatgac 
aatatgctgc aggagcatga caatccgttc 
atctacaccc aaaccagcga tctgaacaaa 
aatgcgcgta aggatgaagt aaagtttcag 
attttaggcc cgaactcggt gttgggtgcg 

50 tccaatagcg aagagtcttc accgtttcgt 
ggttttgcca ccgattaccg ttttgcaggt 
aaccacgact ctaacgggcg ttcagacccg 
cgcctgatgg cagaaaacgg taactggctg 
aatactgacg ataacccgga tatcaccaaa 

55 tatcacctcg gtgatgcggt gctcagtgcg 
ggcgcjcgcgg agttaggctt aagttacccg 
gtttacagcg gctatggcga atcgctcatc 
gtgggggtta tgctaaacga fcttgttttga 
<212> Type : DNA 

60 <211> Length : 870 

SequenceName : SEQ ID 459 
SequenceDescription : 



COli 0157:H7 



gtgtttatgt 
gcgccagcgg 
acgctctatc 
gaagcgattg 
ttgagcctgg 
tcttatacgc 
gaaaccaact 
tggaccjctgc 
acctcccgca 
gtagaagtga 
tatatgggtt 
aaaggacagt 
atcaccaaac 
gactataact 



tgcctatggc 


agtatatgca 


60 


tgcgtggcag 


tattatcgcc 


120 


cttatgacac 


caactacctc 


180 


ccagttacga 


ctgggcggaa 


240 


catttccgct 


gtggcgtggg 


300 


aaaaatcctg 


gtggcaactg 


360 


acgaaccgca 


attgttcctc 


420 


gcgatgtgga 


gatggggtat 


480 


gctggaaccg 


cctttatact 


540 


agccgtggta 


tgtggtgggt 


600 


actaccagct 


taaaatcggc 


660 


acaactggaa 


caccggctac 


720 


atgtgcgcct 


ttatactcag 


780 


tcaaccagac 


ccgtgtcggt 


840 






870 



65 



Sequence 

<213> OrganismName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 
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atggcggttc aaaagaatgt tatcaaaggc atactggcag gtacgtttgjc gctaatgctg 60 

agcggttgtg tcactgtgcc ggacgccatt aaaggcagca gtcccacgcc gcaacaagat 120 

ttagttcggg tgatgagtgc gccgcagctg tacgttggtc aggaggcacg ctttggtggc 180 

aaagtggttg cggtacaaaa ccagcaaggg aaaacccgcc tggaaattgc taccgtaccg 240 

5 ctggacagcg gagccagacc gacgctggga gaaccttctc gcggtcgcat ttatgccgat 3 00 

gtgaacggtt ttctggaccc ggtggatttc cgtggacaac tggttacggt agtcgggcca 3 60 

atcactggtg cggttgacgg aaaaatcggc aacacgccct ataaatttat ggtgatgcag 420 

gcaacaggtt acaaacgttg gcatttaacc cagcaggtga ttatgccgcc tcagccgatt 480 

gatccctggt tttatggcgg tcgtggctgg ccctatggcc acggcggatg gggctggtat 540 

10 aatcccggcc ccgcgagagt acaaacagtt gtaactgaat aa 5 82 



<212> Type : DNA 

<211> Length : 582 

SequenceName : SEQ ID 460 
SequenceDescription : 

15 

Sequence 



<213> OrganisraName : Escherichia coli 0157:H7 
<4 00> PreSequenceString : 

20 atgcgtaagc agtggctcgg gatctgcatc gcggcaggaa tgctcgcggjc atgtacaagc 60 

gatgatggtc agcaacagac agtaagtgta ccgcagcctg cggtatgtaa cggccccata 12 0 

gttgaaatta gcggggcgga cccgcgtttc gaaccactga acgcgacggc aaatcaggat 180 

taccagcgcg acggtaaaag ctacaaaatc gtgcaggatc cgtctcgatt tatccaggcg 240 

ggactggcgg caatctatga tgccgaacca ggcagtaacc tgacggccfcc tggcgaagct 3 00 

25 ttcgatccga cacagctgac ggcggcccat ccaacgcttc cgatccccag ctacgccaga 360 

atcactaacc tggctaacgg gcgaatgatc gtggtgcgca ttaatgatcg cggtccttac 420 

ggcaacgacc gcgttatttc gctttctcgc gcagcagctg accgtcttaa cacgtcaaac 480 

aacaccaaag ttcgtatcga tccgattatt gtcgcccagg atggttcgct ttctggtcct 540 

ggtatggctt gtaccacagt cgccaaacag acttacgccc tgcctgcacc tcccgattta 600 

30 agcggtggcg cgggaacaag ttcagtgtct ggcccgcagg gtgacattct tccggtcagt 660 

aattcgacgc taaaaagcga agatccgacc ggcgcgccgg taaccagcag cggtttcctc 72 0 

ggcgcaccaa cgaccttagc gccaggtgta ctggaaggca gcgaaccgac gcctgctcca 780 

cagcccgttg ttacagctcc gtcgacaacg cctgcaacct cgcctgcaat ggtgacaccg 840 

caagccgcct cgcaaagcgc cagcggcaac tttatggtgc aggtcggggc cgtaagcgat 900 

35 caggctcgtg cgcaacagta ccaacagcaa ctgggacaga agttcggcgt ccccggtcgc 960 

gtaactcaaa atggcgcggt ctggcggatc cagcttggcc cattcgccaa caaagccgaa 1020 

gccagtacct tgcagcaacg tttgcaaacc gaagcccaat tacagtcafet tattaccacc 108 0 

gcgcagtag 1089 
<212> Type : DNA 

40 <211> Length : 1089 

SequenceName : SEQ XD 461 
SequenceDescription : 



Sequence 
45 

<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgattaaac gcgtattggt tgtttcaatg gtaggtctgt ctcttgtcgg ttgtgttaat 60 
aacgacaccc tgtcagggga tgtttatacc gcttctgaag cgaaacaagt acagaatgtc 120 

50 agctatggaa ccatcgttaa cgtacgtccg gtacagattc agggcggtcfa tgattccaac 180 
gttatcggtg caattggcgg tgctgttctt ggtggtttcc tgggaaatac tgttggtggc 240 
ggaaccgggc gttctctggc tactgcagca ggcgctgttg caggtggcgt agctggtcag 3 00 

ggcgtacaga gtgcaatgaa caaaacgcaa ggtgtcgagc tggaaattcg taaagacgat 360 
ggtaatacca tcatggtggt acagaaacaa ggcaacactc gtttctctcc gggccaacgt 420 

55 gtcgtactgg ccagcaatgg cagtcaggtg accgtttctc cgcgctaa 468 
<212> Type : DNA 
<211> Length : 468 

SequenceName : SEQ ID 462 
SequenceDescription : 

60 

Sequence 



<213> OrganisraName : Escherichia coli 0157:H7 
<400> PreSequenceString : 
65 gtgtcaaagg caaccgaaca aaacgacaag ctcaagcggg cgataattat ttcagcagtg 60 
ctgcatgtca tcttatttgc ggcgctgatc tggagttcgt tcgatgagaa tatagaagct 120 
tcagctggag gcggcggtgg ttcgtccatc gacgctgtca tggttgafctc aggtgcggta 180 
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gttgagcagt acaaacgcat gcaaagccag 
cgcaagatga aggaacagca ggctgctgaa 
gaacgcctga agcaacttga gaaagagcgg 
gaagaagccg caaaacaggc cgagttaaag 
5 gcggcggcag atgctaaagc gaaggccgaa 
aagaaagcgg ctgcagacgc gaagaaaaaa 
gaagcgcaga aaaaagccga ggcagcagct 
gaagcagctg cagctgaagc aagaaagaaa 
aaagcagcag aaaaagcggc tgctgaaaag 

10 aaagccgccg cagacaaaaa agcggcagct 
gcaaaagctg ccgcggaggc agatgatatt 
ccgaaaacgg ggggaggggc gaaagggaac 
aaaaacaatg gcgcatcagg ggccgatatc 
atcgaaagta- agttctatga cgcatcgtcc 

15 aaactggcac ccgatggtat gttactggat 
tgtcaggctg cgttggcagc agctaaactt 
gtatatgaag tgttcaaaaa cgcgccattg 
<212> Type : DNA 
<211> Length : 1185 

20 SequenceName : SEQ ID 463 

SequenceDescription : 

Sequence 



25 <213> OrganismName : Escherichia 
<4 00> PreSequenceString : 
gtgatgaaat ttaaaaaatg tcttctgcct 
ggatgccagt caaatgctga cgatcatgcc 
accaaacaag aaactaaaac cgttaatatt 

30 gacaactccc aaaataaacg gaacgcacaa 
ggcggtgtta tcggccacaa cgtcgggtct 
gttggcggcg gagctgtagg cgcggcagcg 
gaaggtgttt ctfctaacgta taaggaaggc 
aaagagtgcc agtttacgac aggtttagcc 

35 cgtattcagc caaataccaa afcgtcctgaa 
<212> Type : DNA 
<211> Length : 519 

SequenceName : SEQ ID 464 
SequenceDescription : 

40 

Sequence 



<213> OrganisraName : Escherichia 
<4 00> PreSequenceString : 

45 atgttgctta gcataatcac tgtcgcgttt 
gcctcgctgg cgcatctggc gcaggcggaa 
ggcggttcca atgacggcac tcgtgagtac 
cgctttgtca gcgagccaga taacggtatc 
gcacaaggca agttcgcgtt gtttctcaac 

50 tattttgtcc gtaagttaaa aatgcaaaaa 
ctggattttg gcgacgggca taaaattaaa 
cacagcctgc ccgccagtca tcaggcgata 
cgttatgacc tggaatataa agtttcttct 
gcaggttatg catttaaaaa actcaatggc 

55 tctaccacca ataatatgga attgtgtgct 
catgtgcctg gcttttgggc tgaattatcc 
acgaaagcct tatataacaa aagctga 
<212> Type : DNA 
<211> Length : 747 

60 SequenceName : SEQ ID 465 

SequenceDescription : 

» Sequence 



65 <213> OrganisraName : Haemophilus 
<40 0> PreSequenceString : 
atgaaattaa caacactgca aaccttgaaa 



gaatcaagcg cgaagcgttc tgatgagcag 240 

gaactgcgtg agaaacaagc ggctgaacag 300 

ttagctgctc aggaacagaa aaagcaggct 3 60 

cagaagcaag cggaagaggc ggcagcgaaa 42 0 

gcggatgata aagctgcgga agaagcagcg 48 0 

gcagaagcag aagccgccaa agccgcagcc 540 

gcggcgctga agaagaaagc ggaagcggca 6 00 

gcggcagcag agaaagctgc agccgacaaa 660 

gcagcagctg ataagaaagc agcggcagaa 72 0 

gcaaaagcag cagctgaaaa agccgctgca 780 

ttcggtgagc taagctctgg taagaatgca 840 

aatgcttcgc ctgccgggag tggtaatact 900 

aataactatg ccgggcagat taaatctgct 960 

tatgcaggca aaacctgtac gctgcgcata 1020 

atcaaacctg aaggtggcga tcccgcactt 1080 

gcgaagatcc cgaaaccacc aacgccaggca 1140 

gacttcaaac cgtaa 1185 



coli 0157 :H7 

gtggcaatgt tagcgtcatt cactctggca 60 

gccgatgttt atcaaaccga tcaactgaat 120 

atttccattc ttcccgcaaa agttgccgta 180 

gccttcggcg cgcttattgg cgcagtcgct 240 

ggcagcaatt ccggaacgac ggcaggtgca 3 00 

ggttctatgg tgaatgataa aaccttagtg 360 

accaaagtgt atacctctac tcaggtgggt 420 

gttgttatta ccacgacgta taacgaaacg 480 

aagagctaa 519 



COli 0157:H7 

cgtaacctcg aagggatagt caaaacacat 60 

gatatcagct tcgaatggat tgttgtcgat 120 

ctggaaaatc tcaatggtat ctataaccta 180 

tacgacgcca tgaacaaggg cattgcgatg 240 

tcgggcgata tttttcatca ggatgccgca 3 00 

gataacgtga tgatcaccgg cgatgcgctg 360 

cgtagcgcca aaccgggctg gta.tatttat 420 

tttttcccgg tatccggttt gaaaaaatgg 480 

gattacgcgc tggcagccaa aatgtataaa 540 

ctggtgtctg aattttctat gggtggggta 600 

gacgcgaaaa aagtccaacg gcaaatatta 660 

tggcatttac gccagcgtac tacctcaaag 72 0 

747 



influenzae Rd 

aaagggttta cattaatcga gctaatgatt 60 
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10 



gtgattgcaa ttattgctat tttagccacc atcgcaattc cttcttafcca aaattatacc 120 

aaaaaagctg cggtatccga attactgcaa gcgtctgcgc cttataaggc tgatgtggaa 180 

ttatgtgtat atagcacaaa tgaaacaaca agctgtacag ggggaaaaaa tggtattgca 240 

gcggatataa agacagcaaa aggctatgta gcctcagtta tcactcaatc aggtggtatt 3 00 

acagtaaaag ggaatggcac attggcaaat atggaatata ttttgcaagc taaaggtaat 360 

gctgcagcag gtgtaacttg gacaacaacc tgcaaaggaa cggatgcctc tttatttcca 42 0 

gcaaattttt gcggaagtgt cacaaaatga 450 
<212> Type : DNA 
<211> Length : 450 

SequenceName : SEQ ID 466 

SequenceDescription : 



Sequence 



15 <213> OrganismName : Haemophilus 
<400> PreSequenceString : 
atgctaaata aaaaattcaa actcaatttt 
ccttatacag aagctgcgtt agtgagagac 
gcagaaaata aagggagatt ttctgttggt 

20 aaccactctt taggcaatgt tttacctaat 
gatgtagata aacgcatcgc cacattgata 
gttagtaacg gcgtgagtga actacatttt 
aatgctaaat cgcaccgaga tgtatcttca 
aatgagtatc caactaaatt gaatggaaaa 

25 cgccgtgaag actactatat gccacgtctt 
gaggcttcaa ctgcaagtag tgatgctggc 
tttgtaagac taggaagtgg tagtcaattt 
attttaaata atcatgaggt tggaggcaat 
tatggtattg caggcacacc ttataaagta 

30 ggcaattcaa aagaggaaca cagcgatcca 
aattatgctg ttttaggcga cagtggctcc 
aaatggcttt ttcttgggtc ttatgatttt 
gaatggaata tttataaacc tgaatttgca 
tctttaactg gttctaacac ccaatacaat 

35 tctaatggtt ctgaatctct aaatgttgat 
aagaagaaca atcacggaaa aagtgtgact 
aataatatcg atcaaggcgc aggcggcttg 
acttctgata gtaccacttg gaaaggagct 
acgtggaaag tacataaccc gaaatctgat 

40 attgtagaag gaaagggaga aaataaaggt 
ttaaaacaac aagctgatgc caataataaa 
agtggtcgct caactgttgt acttaatgat 
tttggcttta gaggtggtcg attagatgcc 
cgtaatattg atgatggcgc aagactagta 

45 acaattactg gggaaagtct aattacagat 
gcaccagatg aagataatcc ttatgccttt 
ttaaatttgg aaaattacac ttattatgcg 
ttacctaaaa atagtggcga aagcaatgaa 
gaagccaaaa gaaatgtaat gaaccatatc 

50 tattttggcg aggaagaggg taaaaataac 
agtgagcaaa atcgcttttt attaacaggc 
gaaaaaggca cattattcct ttctggcaga 
atttcttcga caaaaaaaga tcaacacttt 
gactggatta accgcaattt taaagcaaca 

55 tattcaggtc gcaatgttgc aaacattact 
gtacatattg gctataaagc aggcgatacc 
gtgacttgca ctactgacaa gttatccgat 
gtatctggca atgtaaattt atcaggtaat 
ttcggcacaa ttagcggcac gggaaatagc 

60 catttaacag gcgatagcaa tgttaatcag 
aatgcacaaa acgatgcaaa taaagtaact 
tcaggtaacg gttctttcta ttatttaact 
gttgtaacta aatccgccac aggtaacttt 
cctacaaaaa atgaactcac gctttttgat 

65 gtgtcattag ttgggaatac cgttgattta 
aatggacgtt acgatttgta taacccagag 
acaaatatca caacacctaa taatattcaa 



influenzae Rd 



attgcgctta 
gatgtggatt 
gcaacaaatg 
ggcattccga 
aatccacaat 
gggaacttaa 
gaagaaaata 
gcagtaacta 
gataaatttg 
acatataatg 
atttataaaa 
aatcttaaat 
aaccacgaaa 
aaaggaatat 
ccattatttg 
tgggcaggtt 
aaaactgttc 
tggaatccta 
ttattcgata 
cttagaggaa 
ttctttgaag 
ggcgtttctg 
cgtttagcta 
tcgctaaaag 
gttaaagcct 
gataagcaag 
aatggcaata 
aatcacaata 
ccaaatacaa 
cgacggatta 
ttaagaaaag 
aattggctat 
aacaacgagc 
ggtaatctaa 
ggaacaaacc 
ccaacaccgc 
gctgaaaata 
aatattaatg 
tcaaatatca 
gtttgtgtac 
aaagccctta 
gcaaactttg 
caagtacgtt 
ttaaatttag 
acatataaca 
gatctttcca 
acattacaag 
gcgtcaaatg 
ggtgcttgga 
gtggaaaaaa 
gctgatgtgc 



ctgtcgccta 


cgcattaacc 


60 


atcaaatatt 


tcgtgatttt 


120 


tggaagtcyag 


agataaaaat 


180 


tgattgattt 


tagtgttgtg 


240 


atgtagtagg 


tgtaaaacac 


300 


atggcaafcat 


gaataatggc 


3 60 


gatatttfctc 


cgttgagaaa 


420 


ctgaagatca 


aactcaaaaa 


480 


ttaccgaagt 


tgcaccaata 


540 


atcagaataa 


atatcctgct 


600 


aaggagat aa 


ttacagctta 


660 


tggtgggcga 


tgcctatacc 


720 


ataatggact 


aattggtttt 


780 


tatctcaaga 


tccgcttacc 


840 


tatatgat ag 


agaaaaagga 


900 


ataacaaaaa 


atcttggcaa 


960 


tagataaaga 


tactgcaggt 


1020 


ctggcaaaac 


aagcgt t at t 


1080 


gt agt c agga 


tactgactct 


1140 


gtggaacgct 


taccttaaat 


1200 


gagattafcga 


agttaaaggc 


1260 


ttgctgatgg 


aaaaacagta 


1320 


aaatcggcaa 


aggaacatta 


13 80 


tgggcgatgg 


taccgtcatc 


1440 


tttcacaagt 


aggtatagta 


1500 


tagatccaaa 


ttccatttac 


1560 


=j (^-» -1— i—i =a • 1 — 4-4- 




162 0 


ccagcaaaac 


ctctactgta 


1680 


ttactccata 


taatatagac 


1740 


aagatggagg 


acagctctat 


1800 


gtgcgagcac 


tcgttcagaa 


1860 


atatgggtaa 


aacttccgat 


1920 


gtatgaatgg 


ctttaacggt 


1980 


atgtgacfctt 


taaaggcaaa 


2040 


ttaatggcga 


tttaaaggtt 


2100 


acgcaagaga 


tattgcaggt 


2160 


atgaagtggt 


agtagaagat 


2220 


taaccaataa 


cgcaaccctt 


2280 


cagcttcfcga 


taatgcaaaa 


2340 


gttctgacta 


tacgggctat 


2400 


atagctttaa 


cgccaccaat 


2460 


tcttaggcaa 


agctaactta 


2520 


taaccgaaaa 


tagccattgg 


2580 


acaaggggca 


tattcattta 


2640 


cgctgactgt 


gaatagctta 


2700 


ataaacaagg 


cgacaaagtt 


2760 


tggcagafcaa 


aacaggcgag 


2820 


ctacaagaaa 


taatttgaat 


2880 


aatataaatt 


acgtaatgtt 


2940 


gaaatcaaac 


tgtcgatacg 


3000 


ctagcgtacc 


aagtaacaat 


3060 
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gaagaaatag cccgtgttga aacaccagtt ccaccacctg cgcctgctac accatcagag 3120 

acaactgaaa cagtggctga aaatagtaag caagaaagta aaacagtaga gaaaaacgag 3180 

caagacgcaa ccgagacaac agctcaaaat ggagaagttg cagaagaagc taaaccaagt 3240 

gtaaaagcta atactcaaac aaatgaagtg gctcaaagtg gaagtgaaac cgaggaaact 33 00 

5 caaacgactg aaataaaaga aacagctaaa gtagaaaaag aggaaaaggc taaagtagaa 3350 

aaagatgaaa ttcaagaagc acctcaaatg gcttctgaaa cgtctccgaa acaagcaaag 3420 

cctgctccta aagaagtttc aactgatacg aaagtagaag aaactcaagt tcaagctcaa 3480 

ccgcaaacac aatcgacaac tgttgctgcg gcagaggcaa cttcgccaaa cagtaaacca 3540 

gcggaagaaa ctcaaccaag tgaaaaaact aacgctgaac ctgtaacgcc tgtagtatca 3 600 

10 aaaaatcaaa cagaaaatac gaccgaccaa ccaacagaaa gagagaaaac ggctaaagta 3 660 

gaaacagaga aaactcaaga accccctcaa gtggcttctc aagcgtctcc gaaacaggaa 3720 

cagtctgaaa ctgttcaacc gcaagcagtg cttgaaagtg aaaatgttcc gactgttaat 3780 

aatgcagaag aagttcaagc tcaactgcaa acacaaacaa gtgcaacagt aagcactaaa 3 840 

caacctgcac cagagaattc aataaatact ggatctgcaa ccgcaataac agaaactgct* 3 900 

15 gaaaaatccg ataaaccaca aacggaaact gcggcttcga ctgaagatgc tagtcagcat 3 960 

aaagcgaata ctgttgcgga taattctgta gcaaataatt cagaaagcag tgatccaaag 4020 

agtagacgta gaagaagtat tagccagcct caagagactt ctgctgaaga aacaacagca 4080 

gcttctactg acgaaacaac aatagctgat aattcaaaac gcagtaagcc aaatcgtaga 4140 

agtagaagaa gtgttcgctc ggaaccaact gttacaaatg gcagcgatcg ttctacagta 4200 

20 gcattgcgcg atctcacaag tacaaacaca aatgcggtaa tttctgatgc aatggcaaaa 4260 

gcacaatttg ttgcattaaa tgtggggaaa gcagtttctc aacatattag ccagttagaa 432 0 

atgaataacg aggggcaata taacgtttgg gtatctaata cttcaatgaa cgaaaattat 4380 

tcctcaagtc aatatcgtcg ttttagttct aaaagtacgc aaactcaact tggttgggat 4440 

caaacaatct caaacaatgt tcagttaggt ggcgtgttta cttatgttcg caatagtaac 4500 

25 aactttgata aggcaagcag taaaaatact ctagcacaag ttaatttcta ttctaaatat 4560 

tatgcggata atcattggta tttgggcatt gatttaggct acggcaagtt ccaaagcaac 4620 

ctaaaaacca atcataatgc gaaatttgct cgccatactg cacaafcttgg tttaaccgca 4680 

ggcaaagcat ttaatcttgg caattttggt attacgccaa tagtaggcgt gcgttatagc 4740 

tatttatcaa acgctaattt tgcattagct aaagatcgca ttaaagtaaa tccaatatct 48 00 

30 gtcaaaacag cctttgctca agttgattta agttatactt atcacttagg cgagttttcc 4860 

gttacgccaa ttttgtctgc tcgatatgat acaaatcaag gcagcggaaa aattaatgta 4920 

aatcaatatg attttgctta caacgtggaa aaccaacagc aatataacgc agggcttaaa 4980 

ttgaaatatc ataatgtgaa attaagtcta ataggcggat taacaaaagc gaaacaagcg 5040 

gaaaaacaaa aaactgcaga attaaaacta agttttagtt tttaa 5085 

35 <212> Type : DNA 

<211> Length : 5085 

SequenceName : SEQ ID 467 

SequenceDescription : " 
40 Sequence 



<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atggcattag taaacaaaat taaaacatta tcatcagtag gtattctagc ggctacatta 60 

45 tttcttgcag gctgccaagc acaatcaaat atattagcat ttacaccgcc tgcaccaagt 120 

gcttcaatga atgttaatcg aactgccgtt gtatctgtga caacaaaaga tagccgtgca 180 

atacaagaga ttgcgagtta tacgaaacac ggggaactga ttaaattaaa tgcatcccca 240 

agtgttacac aattatttca gcaagtgatg cagcaaaatt taattagtaa aggttttaga 300 

gttgggcaat taaatggttc aaatgcgtgg gtaactgtgg atgtgcgtga atttggtacg 3 60 

50 caagtagaac aaggtaatct tcgttataaa cttaatacca aaattcaagc gacagtttat 420 

gtacaaggtg cgaaaggttc gtataataaa tcatttaatg tcacgcactc acaagagggc 480 

gtatttaatg cgggcaatga tgaaattcat aaagtgctat ctcaaacttt taatgatatt 540 

gtgaacaata tttatcaaga tcaagaagtt gcggctgcga taaaccaata ttctaattaa 600 



55 <212> Type : DNA 

<211> Length : 60 0 

SequenceName : SEQ ID 468 
SequenceDescription : 

60 Sequence 



<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atgctgtgct ggattggcta caaaaatggg attttaccgc agcagaacag cacgctatat 60 

65 ccttggctga atccgtcaaa gtgcggcgta atttttgatg gttttcaact tgtgggcgat 12 0 

gatttcaatt cagatcaaac ggctgaaaat acatcgccag cttggcaagt gctttacaca 180 

acccatttac aaagttgctc gccgattcat agtggagaaa atttcgcacc catccctttg 240 
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tataaacaac taaaaaatca accgcaccta agccaagatc tgattaaatg gcaagaaaat 3 00 

tggcaagcct gcgatcaact acaaatgaat ggtgcggtat tagaacaaca atctttggca 3 60 

gaaatttccg atcatcaaag tacgctttca aaacacggac gatatttagc tcaagaaata 420 
gaaaaagaaa ctggcatacc gacttactat tatttatatc gtgtaggtgg gcaatcttta 480 
5 gaatctgaaa aatcccgttg ctgcccttct tgtggtgcaa attgggcgtt aaaagacgcg 540 
atttttgata cctttcattt taaatgcgat acctgtcgat tagtttcgaa tctatcgtgg 600 
aattttttgt aa 612 
<212> Type : DNA 
<211> Length : 612 
10 SequenceName : SEQ ID 469 

SequenceDescription : 

Sequence 



15 <213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

ttgggtgcat ttgcctttgc ttctgttaca aatgcgaata tttatgctga gggcgatatc 60 
ggtttatctc aaactaaagc aaacggtagt aacaatacaa gagttggacc tcgcgtatcc 120 
gtgggttata aagtaggaaa tacacgtgtt gcgggtgatt atactcatca tggaaaagtt 180 

20 gatggcacaa aaattcaagg tttaggtgca tcagtattat atgattttga cacgaattct 240 
aaagtgcaac cttatgttgg tgctcgtgta gcgactaatc aatttaaata caccaatcgc 3 00 

gcagaacaaa agtttaaaag ttcttctgat attaagctcg gatatggggt tgtagcaggt 3 60 

gcaaaatata agttagatgg caactggtac gcaaatggtg gagttgagta caafccgttta 420 
ggtaattttg atagtaccaa agttaataac tatggtgcaa aagttggtgt ggggtacgga ' 480 

25 ttctaa 486 
<212> Type : DNA 
<211> Length : 486 

SequenceName : SEQ ID 470 
SequenceDe script ion : 

30 

Sequence 



<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

35 atgaaaaaac ttctaatcgc aagtttatta ttcggtacga caacgactgt gtttgccgca 60 

ccttttgtgg caaaagatat tcgtgtggat ggtgttcaag gtgacttaga acaacaaatc 120 

cgagcaagtt tacctgttcg tgctggtcag cgtgtgactg acaatgatgt ggctaatatt 180 

gtccgctctt tattcgtaag tggtcgattc gatgatgtga aagcgcatca agaaggcgat 240 

gtgcttgfctg ttagcgttgt ggctaaatcg atcatttcag atgttaaaat caaaggtaac 300 

40 tctattattc ccactgaagc actaaaacaa aacttagatg ctaacgggtt taaagttggc 360 

gatgttttaa ttcgagaaaa attaaatgaa tttgccaaaa gtgtaaaaga gcactatgca 420 

agtgtaggtc gctataacgc aaccgttgaa cctattgtca atacgctacc aaataatcgc 480 

gctgaaattt taattcaaat caatgaagat gataaagcaa aattggcatc attaactttc 540 

aaggggaacg aatctgttag tagcagtaca ttacaagaac aaatggaatt acaacctgat 600 

45 tcttggtgga aattatgggg aaataaattt gaaggtgcgc aattcgagaa agatttgcag 660 

tcaattcgtg attattattt aaataatggc tatgccaaag cacaaatcac taaaacggat 720 

gttcagctaa atgatgaaaa aacaaaagtt aatgtaacca ttgatgtaaa tgaaggttta 780 

cagtatgacc ttcgtagtgc acgcattata ggtaatctgg gaggtatgtc tgccgagctt 840 

gaacctttac tgtcagcatt acatttaaat gatactttcc gccgtagtga cattgcagat 900 

50 gtagaaaatg caattaaagc aaaacttgga gaacgcggtt acggtagcgc aacggtaaat 960 

tcagtacctg attttgatga tgcaaataaa acattagcga taacccttgt tgttgatgct 1020 

ggacgacgtt taactgttcg ccaacttcgc tttgaaggaa ataccgtttc tgctgatagc 1080 

actttacgtc aggaaatgcg ccaacaagaa ggaacttggt ataattcaca attagttgag 1140 

ttaggaaaaa ttcgcttaga tcgtacaggt ttcttcgaaa cagtcgaaaa ccgaattgat 1200 

55 cctatcaatg gtagtaatga tgaagtggat gtcgtatata aagtcaaaga acgtaacacg 1260 

ggtagtatca actttggtat tggttacggt acagagagtg gtattagtta tcaagcaagt 132 0 

gttaaacaag ataatttctt gggaacaggg gcggcagtaa gtatagctgg tacgaaaaat 1380 

gattatggta cgagtgtcaa tttgggttat accgagccct attttactaa agatggtgta 144 0 

agtcttggtg gaaatgtttt ctttgaaaac tacgataact ctaaaagtga tacatcctct 150 0 

60 aactataagc gtacgactta tggaagtaat gttactttag gtttccctgt aaatgaaaat 1560 

aactcctatt atgtaggatt aggccatacc tataataaaa ttagtaactt tgctctagaa 1620 

tataaccgta atttatatat tcaatcaatg aaatttaaag gtaatggcat taaaacaaat 168 0 

gactttgatt tttcttttgg ttggaactat aacagcctta atagaggcta tttcccaact 174 0 

aaaggggtta aagcaagtct tggtggacga gttactattc caggttctga taacaaatac 180 0 

65 tacaaactaa gtgcagacgt acagggtttc tacccattag acagagatca cctctgggtt 1860 

gtatctgcaa aagcatctgc aggatatgca aatggttttg gaaacaagcg tttaccgttc 1920 

tatcaaactt atacagcggg tggcatcggt tcattacgtg gttttgctta tggtagtatt 198 0 
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ggacctaacg caatttatgc agaacatggt 
tctgatgtga ttggtggtaa tgcaatcaca 
ccgtttgtga gtgataaaag ccaaaataca 
agtgtttgga atactaagtg gaaatcggat 
agcttacccg attatggcaa atcaagccgt 
tggcaatctc ctattgggcc attggtattc 
aatgatgatg tcgaacagtt ccaatttagt 
<212> Type <" : DNA 
<211> Length : 2388 

SequenceMame : SEQ ID 471 
SequenceDe script ion : 

Sequence 



aatggtaatg 
actgcgagtg 
gtccgaacct 
aagagtgggt 
attcgcgcct 
tcttatgcca 
attggaggtt 



gtacttttaa 
cagaacttat 
ccctatttgt 
tagataacaa 
ctacaggtgt 
aaccaattaa 
ctttctaa 



gaagataagt 
tgtaccaaca 
tgatgcggca 
tgtattaaaa 
cggattccaa 
aaaatatgaa 



<213> OrganismName : Haemophilus 
<400> PreSequenceString : 
atgttgaaaa aaacatctct tatttttacc 
gcgaatgtaa caacacctca agcgcaaaaa 
caaaaaggcg aagctgatcg atatttatgt 
gccacgcata aaaaatacaa aaaaaatttg 
tcagaaaaac taaccttaat gatttctgaa 
atgtggcaag agcgtgatga ttttagtacg 
acgcaatgtg tctcacaaac aagtgaacgc 
<212> Type : DNA 
<211> Length : 405 

SequenceName : SEQ ID 472 
SequenceDescription : 

Sequence 



influenzae Rd 



gcacttttaa 
atgcaagtag 
caagatgata 
cattatgtta 
cgtggtaaaa 
ctaaaaacga 
ttatctggac 



tgactggctg 
aaaaagtgga 
gagttgttcg 
ctgtcacttt 
attacgccaa 
atctcggcga 
aataa 



tgtgcaaaat 
taaagcctta 
tgttgtacac 
tcaaggcgta 
tattcgttgg 
aattttagca 



<213> OrganismName : Haemophilus 
<400> PreSequenceString : 
ttgagaatta ttattatttt ttttatggga 
gcttgcctat tccgttatgc ttgggctaac 
caaccaacca accaaccaac caaccaacca 
caaccaacca accaaaatag taatgtttct 
tctaccgaaa atagtgatac aaaaacacca 
aaaacgctgg aaagagaaca agcaaacaac 
gttactgttg ttgaagctgg acgttttggg 
gaaaatcgtg tagcgattaa tattgatgga 
ggctttaaag agctttttga aggttatggt 
attgaaacct taaaagaagt aaatatcaca 
ggttccttag gtggttctgt aatttataaa 
aaggattact atgtaagtta caaaaaggga 
acccttactc ttgcaggacg ttataaaaag 
aatgggcatg aacttgagaa ctatggttat 
aaaagagaaa aagcagatcc atataaaatt 
ttcaatccta cagaaaatca tcgttttacc 
cgtgggcaag atttatccta tacattaaaa 
gttgattcta gacacaccaa tgataaaaca 
aatttctctc aaactccatt ttgggatacg 
aaaactcgtg cacgcacaga tgagtattgt 
gacaatccta cgggactaaa agtaacaaat 
cttcaatttg aggaaaaaaa caatacagct 
aaatttattg atactgataa gagagtaata 
gacacttggt atgattgttc aatatttaat 
aaaggtaaca attattatgg ctatgatgga 
gaattaaatg gcaaaaaatt cgctaaaata 
cttccctctt cacctggtta tttagaacgc 
acccaacaat taaatttaga tttaaccaaa 
ctacaatatg gtggatcata taataccgcg 
gatgcttctg atgtgcaatg gtgggcaaca 
aaacctcaca cttgtgcaac gacttatgag 
cctgaatttt cttacttatt acccattaaa 
aattttgtta taactgatta tttatctttt 
tatcaaccaa aatataaaca cggtatcaca 
ttatttattc cattaccaaa caattcaaat 



influenzae Rd 



ttaaatatga 
ggcaggtgtt 
accaaccaac 
gaacaactag 
ccaaaaattg 
attaaagaca 
caaagcggtt 
ttacgtcaag 
aatttcaata 
aaaggggctg 
acaaaagatg 
tacgctacgg 
ttcgatgtgc 
aaaaattata 
gaacaagata 
tttgccgctg 
tatcaacgta 
aagagacgta 
ctaaaactca 
gatgctggtg 
gggaaaataa 
aagagtagtg 
gacgataaac 
tgtgaaaata 
aaatggaaag 
aaggataatg 
ctctggcaag 
gacttcaaaa 
atgaagcgca 
cctacgcttg 
tggaatgcta 
acaacaggaa 
gatttgggtt 
cccaaattac 
tcagatccta 



ccaattttag attagaacgt 
gcttatgcag ctcaaccaac 
caaccaacca accaaccaac 



aacaaataaa 
ctgaaacggt 
tcgttaaata 
ttgccattcg 
ctgaaacctt 
acacgcgtaa 
attctattaa 
cgagagatta 
aaaataatca 
tagtggttac 
acgataaaat 
gtacattatt 
atttatatga 
gtggtaatga 
atatttcatt 
cttattctga 
taagacattg 
cacgtcgaga 
ataaaaccta 
tagtcctaaa 
acgcaaaaat 
aagttgacct 
ataggaaaat 
agagagattt 
tttggcatat 
tggttaatcg 
gtgaggattc 
acctttgtcc 
aatcagtcta 
atcgttatga 
ccgatgatat 
ataaagttaa 



tgtatctggt 
aaaaacagct 
cgaaacgggc 
tggtgtagat 
atcttctcaa 
tggtgcagaa 
gaatggtagt 
tctcataaac 
atcattcgat 
aacaagcaga 
tcaaggtaaa 
aaaattatct 
acatcgttct 
aacccctgaa 
tagttatgaa 
tcaacgtatt 
tgaaggcaca 
tggttcagac 
tgacttcaag 
caacccctct 
aaaagttttt 
tgaaataaaa 
aaaatctatt 
ggacaccaac 
tgaacataat 
tgctggcaac 
ttggactgga 
tcgagttgat 
tctctttgat 
caatatccat 
tgtgaaagga 
ggaaaatgta 



2040 
2100 
2160 
2220 
2280 
2340 
2388 



60 
120 
180 
240 
300 
360 
405 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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caacaaaata ttgactatat cgccaaacag aacaaaaaat ataaagcaca ttcttatagt 2160 

tttgtttcaa ccattgatcc aacgagtttt cttcgtttac aactaaaata ttctaaaggt 222 0 

tttagaacac caacttcaga tgaaatgtat ttcaccttta aacaccctga tttcactatt 2280 

ttgccaaata ctgatttaaa accagaaata gcaaaaacca aagaaattgc tttcacatta 2340 

5 cataatgatg attggggatt tatctcgaca agtctgttta aaactaacta taaaaacttt 2400 

attgacctaa tatttaaaaa gcaagaaact tttaaagtag gcggctctgg aagaggtgaa 2460 

acattaccat tttctcttta tcaaaatata aatagagata atgcgtcttt aaaaggtatt 2520 

gaaattaatt caaaagtatt ccttggtaaa atggcaaaat ttatggatgg atttaaccta 258 0 

agctataaat atacctatca aaaaggcaga atgaatggca atattcctat gaatgcaatt 2640 

10 cagcctagaa ctatggtata tggtttagga tatgatcatc caaatcataa atttggtttc 2700 

gatttctata cgacacatgt agcaagtaaa aatccagaag atacttataa tatgttctat 2760 

aaagaagaaa ataaaaaaga cagcacaatt aaatggagaa gcaaatctta tactattcta 2 820 

gatttaattg gatatgtaca accaattaaa aatttaacca taagagctgg tgtatataat 28 80 

cttacaaacc gtaaatacat tacttgggat tctgctcgtt caattcgttc atttggaaca 2940. 

15 agtaatgtta tagatcaatc aacaggatta ggcattaacc gcttctacgc accaggtaga 30 00 

aattataaaa tgtcagttca gtttgaattt taa 3033 
<212> Type : DNA 
<211> Length : 3033 

SequenceName : SEQ ID 473 

20 SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 

25 <4 00> PreSequenceString : 

atgacttata gaaatggcaa aatagattta aaggaacgct ttagtaaaaa ccgctctttt 60 

aagggcatta aaaagaaaat cgctaaaaaa tatacaatca aaaactcgct ttctataatt 120 

tattccttaa aaacgcattc aaattcttct ctatccatta ataaaaaaat cttcttaggg 180 

ctagggttcg tttcggcttt gagcgctcaa agtgaagatt ataatagttc ggtgtattgg 240 

30 ctcaatagcg tgaatgaaaa caataataac aaatcctact atattagccc cttacgcact 300 

tgggctgggg ggaataggag ttttacgcaa aattataaca atagtcaatt atacataggg 3 60 

acaaaaaacg cttccgcaac gcccaatcat tcttctgtgt ggtttggaga aaagggctat 420 

atcggtttta ttacaggggt ttttaaggct agagacattt ttatcacagg agctgttgga 48 0 

tcgggtaatg agtfcaaaaac cggtgggggg gcgatactgg tttttgaaag ctcaaacgaa 540 

35 ctaaccacta acggggctta ttttcaaaat aacagagccg ggacacaaac ttcttggatc 60 0 

aatttgattt ccaataacag cgtgaatttg acaaacacgg attttggcaa t'caaacccct 660 

aatgggggct ttaatgttat ggggcgaaag attacttata atggtgggag cgtcaatggt 720 
gggaattttg gctttgataa cgtggatagc aatggcgcaa ccaccattag cggggtaact * 780 

ttcaacaata acggtgcgct cacttataag ggtgggaatg gtattggagg gagcatcact 840 

40 ttcactaact ctaatatcaa tcattacaag ctcaatctta acgctaatag cgttaccttt 900 

aataacagca ctctagggag tatgcctaat ggcaacgcta acactatagg gaatgcctac 960 

attcttaatg caaataatat tacttttaat aatttgacct ttaatggggg ttggttcgtt 1020 

tttaatagat ctgatgctca tgttaatttt caaggcacaa ctacgatcaa taaccccact 108 0 

tcaccctttg tcaatatgac cggtaaagtt accattaatc ctaatgcgat ttttaatatt 1140 

45 caaaattaca cgcccacgat agggaacgct tacacgctct ttagcatgaa aaatggcaat 1200 

atcgcttatg atgatgtgaa taatttatgg aatattatca ggcttaaaaa cacgcaagcc 1260 

acaaaagaca atagcaaaaa cgccacttcc aataacaaca cccacactta ctatgtaact 1320 

tacaatttag gcggcacgct ctatcatttc aggcaaattt ttagccctga ttccattgtt 1380 

ttacaatccg tctattatgg cgcgaataat ctttactaca ccaatagcgt gaatatccat 1440 

50 gacaatgtct ttaatttaaa aaatattaac gatgataggg ctgatacgat tttttatctt 1500 

aatggcttga acacttggaa ttacacgcaa gcgagattcg ctcaaaccta tggcgggaaa 1560 

aacagcgctt tagtctttaa cgccacgact ccttgggcta atggtgcgat ccctaaatct 1620 

aacagcacgg tgcgttttgg ggggtatgag ggagtcaatt gggggaaaac gggctatatc 1680 

accggcactt tcacagccga tagggtttat atcaccggta acatgatgtc tggcaatggc 1740 

55 gctcaaaccg gtgggggggc gactttgaat tttgtgggcg cgactgaaat taatatcgct 1800 

ggagccactt ttaaaaactt aaaaaccact tcacaaaact cttacatgac ttttatggcg 1860 

ttagggaatg gctctgggag tggtaagatc aatgtttctc agtctgattt ttacgattgg 1920 

acggatggag ggtatgattt taccggtaat ggcgtttttg acagcgtgaa tttcaacaag 1980 

gcttattaca aatttcaagg cgctgaaaat tcttacaatt ttaaaaacac gaatttttta 2 040 

60 gcagggaatt tcaaattcca gggcaagacc accattgaaa aatccgtttt aaacgacgct 2100 

tcttacgctt ttgatggcgt gaataacgcc tttaatgaag acaaatttaa tggcggatcg 2160 

tttaatttca accacgcaga gcaaacaaac gcttttaata acaactcgtt tagtggcgga 2220 

tcgtttagtt ttaacgccaa gcaagtggat tttaatggga attcgtttaa tgggggggtg 2280 

tttaatttca ataatacccc taaagccagt tttactaacg acacttttaa tgtgaataac 2340 

65 caattcaaaa taaatggcgc tcaaacggat tttactttca gtaagggcgt tgttttcaac 2400 

atgcaagggc ttttgagcag tttgagcgta ggcacgactt atcaattgct taacgctaaa 2460 

agcgtgggtt ataaggataa caataacgct ttgtatcaaa tgttgcgctg gactagcgga 252 0 
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gaaaatccta gcggtaaatt agtagatgaa 
tataatgttc aattcactga taacggcttg 
gggatcacgc tcactcgttt atgcactcta 
gatgcgttta atcttaaaaa tgtcaataat 
5 ggcatgacga cttggaagac tgctggcaca 
aacagcgttt tagtgttcaa tcagaccacc 
aatagcgtgg tgggttttgg gaaaacttca 
caaggcgttt ttaaagccaa tcaaattgat 
gccaaaaccg gtgggggcgc gactttagtg 
10 aacgctaatt tgaataacga taaagccggt 
aataatggca atttgaacgt aacaaacgca 
tttaacctta aagccaataa tatcacttgg 
tttggtgtgg ataacgccaa cgctaatggg 
gataacggca ctttgattta taaagggggt 
15 gaaaacaaca ccttcaattc ctataatatc 
aacaactcgt ttaacagcgg tagctattcg 
ggcacgaaca cgctcattaa cagcgatcct 
gacaataata gtatctttaa cattgaaagg 
ttaagtgggg ataacatcaa atacaataac 

20 aatttatggg atttgatcca ttatgacggc 
aacacttatt ttgtgcaatt cacgcagagc 
tttaatcctg gctctatcac ctataaatat 
gaagctgatt ctaaggatat ttggaatcag 
aaaaccccgg tgtgcgttgg cgtgtgctat 

25 ggctctagcg cttttgcgtg gtcgctgaat 
ttagggagcg ctcaagaaaa agccaataac 
aatttgctgt atttgcatgg caattttaac 
aatgtcggca accctaacgc cggcggtggg 
ttgagcgctg acgggttgaa ttacacgaat 

30 agcgcgagcc agcattcatg ggcgaatttt 
tccaacttta gggatttcac atggggaggc 
gaaaacacca cttttagcgg ctggactaat 
tatgtgaaca tggttgcgaa tacggatttg 
attcgctatg atttgaaggc taataacatt 

35 gtgtctaaaa acgtgaatca gtcttcattg 
ctttcagfcca aacccaatgc ggctatcaat 
gaaaacgctt caagcctttc tttttataac 
gcttttaacg gggtgtctta cttgaattta 
gcgaatttca ataacgctaa tgtaaccttt 

40 aattttggca actctgtgcg ccttatcaat 
acgctcaatt taagggctaa aaatatccat 
aataactcfca cgatgaattt ggctgaaagt 
gagggggaaa cgaatttcaa tctcaacggc 
gfcttttaacg cccctgtgaa tttctacgct 

45 gcgactttta atgcagacgc ttcatttgat 
agcgttcttt taaacagcgc tctaaacctt 
aatgctaaag ggaattttag ttttggatct 
ctatttggag gggataaaaa agcttccgtt 
ggcttaaggg ggaataacgg ctatgagaag 

50 gccgattact cgttcaataa tggcgttcat 
actgaaacca ttaccgaaac cttacataat 
ggtgcttcta ataatgcgat gtttaatctc 
ccttatgatg aaagcgagaa ttcctataat 
ttgagtagca gtatcaaagg ctttggtaag 

55 caaaaccaac ccttacaagc tttacacatt 
aacatgatcg ccagtttggg taaggaattt 
ggggctttag acaatctcaa tctcaatagc 
ttaaaagaat atggcattac tttaaaccaa 
aataattttt ctaacacggc taattatcat 

60 atcaaagaag ggcaaacgaa cacgaatagc 
aatccatgcg cggttgggga taacacttgc 
ttgcttaatt ctagcgtgcc ttatttgggt 
atttatatca ccggaaccat cggcagtggg 
gtgtcttttg aaagcgcgac aaatttggtg 

65 accgataaga tcttttctta cttgggcaaa 
ggtttaggga atgtgctttc taatattgtt 
cctaaagatt tagccaacat gatccctaaa 



aataaaaccg cgccaaacag cgctaaaatt 25 8 0 

acttactaca ttaaagaaaa ttttaataat 2640 

ggctacacgc attgcgtgaa tattgataac 27 0 0 

aacgctagta acaccgtgtt ctatctcaac 2760 

ggagttttca cgcaagatta cagcggcact 2820 

ccttttcttg ctggggcgaa tcccacttcc 2880 

ggggctgaat gggggctagt gggctatatt 2940 

attaccggca cgattcgctc tggtaatgga 300 0 

tttaacgctc aagagcgttt gaatatcgct 3 0 60 

ttgcaaaact catggatgaa tttcattgtt 3120 

aattttagca accaaacccc gcatggaggc 318 0 

gataaaggct ctgtgagtgg gggggggaat 3240 

aatgcggtga ttaagaatgt taatttcagc 33 0 0 

gaaaacagcg ccggaaattc cctgacctta ,3360 

aacgccaaag cgcaaaacct tattttcaac 3420 

tttaacgaca ctaaaaatgt tacttttaaa 348 0 

ttcagccgcc ttaaaggatc agtttctatt 3540 

gatttgaccg ataaaaccac ttacacgctt 3 600 

caagctttag cggataatgt tttttcaaaa 3660 

gaacaaggga ctctattaag aacggataac 3720 

aacggccaaa aatttgtttt tgaagagact 3 78 0 

ttcactatcc attcttcgcc tttccacaca 3840 

gtgaggaagc agtttgattt tattccagga 39 00 

atcgcaccct ataaaaatca agatcttatc 3960 

tttggggcta cggtggtggg gactttgctt 4020 

aatggcggct cgatctggtt tggtaagaat 4080 

gcgactaata tctttttaac gaataacttt 4140 

gcaacgatta attttaacgc tgatgaaacc 42 00 

ttccaaaccg tggctatggg cttacaaact 42 60 

aattccaagc tttctatgga gattaaaaac 4320 

tttaggttca attcagggcg tatcactttt 438 0 

attaacgggg cgactgaaag cggttcatcg 4440 

attttcactg attccatttt aggagggggc 450 0 

attttcaata acactcaaat ggtggttgat 4560 

aatgggaatg ttactttcaa tcattccagg 462 0 

attggggggg atcagaccca aacgactfcta 4680 

gatagcgtag cgaattttaa cggcacgacc 4740 

aaccctaacg ctcaagtcag cttcaatcaa ' '48 00 

tatggcattc cgctatttgg taaaacgccc 4860 

ttcaaagggg acgcaaagtt taatcaagcc 4920 

ttgaatttcc aaggggcttc cacttttgaa 4980 

tctcaagcga gctttaacgc tcttagcgtg 504 0 

tcaagtttat tgagt-ttcaa tggtaacagc 5100 

aataattctc aaatttcttt cactcattcg 5160 

ttaggcaata "acagcaccct gaattttcaa 522 0 

ttaggcaatg gcggtaacaa tctagcgatt 52 80 

caagggattt tgaatctgtc ttatatgaat 5340 

tatgatgtgt tgcaagccca aaatattgat 5400 

atccgttfctt atggcataca gattgaaaag 54 60 

tcttggagct tcactaaccc gctcaacacg 5520 

aaccgcttga aagtgcagat ctctcaaaac 55 80 

gctcctagct tgtatgatta ccaacaaaac 5640 

cacacaagcg ataaagccgg cacttattat 5700 

aataatgaaa tacccgggac ttataacgcg 5760 

tataatcagg ctatcagtaa gcaggatttg 5820 

ttgcctaaag tggctaaact tatcgcttca 58 8 0 

ccggatagct ttgaaacgat ttttagtatc 5940 

gcgaattgga agagcttatt gaagatcatc 60 00 

ttttctcaag gtagtctcgt tgtgggggcg 6060 

gtggtgtggt ttgggggcga tgggtataaa 6120 

cagatgttca ggcagactaa tttagggcag 6180 

tacattaacg ctaattttaa ggctaaaaac 6240 

aacgcttggg ggagcggagg gagcgcgaat 63 00 

cttaatcaag ccaatattga cgctcaaggg 63 60 

gagggcattg ataagctttt tggagaaaaa 642 0 

tatgaagaga gcttgaatga taacgctatc 6480 

gatttgggat ctaaaacttt aagctctttg 6540 
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cttagcccta ctgaagtgaa taacctctta ggcgtgagtg cttttaaaaa cgcgatcatg 660 0 

gaaatcttaa attctaaaac ggtgggcgat gtttttggtg aaaacgggct tfctaaacgcg 6660 

ctagatcctg taaaaagaaa agaaattgat caaatgcttt tagagcaaat ccaagcccat 6720 

tcttcagggt ttgaaaaatt catcgttaaa actttaggga ttgaaaatgt agagaatttc 6780 

5 atcaataact ggtatggcaa gcaaagcttg agttcttttg ccaataattt tgtgcctgga 6840 

ggcttgaatc aagccctaga taaaataggt tctagctctg atgccaaaga cttacagagt 6900 

ttcttagata aaacgacttt tggggatatt ctcaatcaaa tgatcaatca agccccttta 6960 

atcaataagc tcatttcttg gctcggcccg caggatttga gcgtgttagt gaatatcgct 7020 

ttaaatagta tcactaaccc tagtaaggaa ttattgggtg cgatttctgg catgggtcaa 708 0 

10 aaagtgctga acgatttgct aggcgaaggt gtagtgaata aaatcatgag caatcaagtt 7140 

ttagggcaaa tgatcaataa aatcatcgct gataagggct ttggaggcgt ttatcatcaa 72 00 

ggtctaggct caatactccc taaatcctta caagatgagt tgaagaaatt gggtatgggc 7260 

tctttactca aacctaaagg cttgcacaac ctctggcaaa aagggaactt caatttcgtg 732 0 

gctaaaaacc atgtgtttgt- gaataacagc ttgtttagta acgccacagg gggggaattg 73 80 

15 aattttgtag cgggcaagtc cattattttt aatgggaaaa acaccattaa tttcacgcag 7440 

tatcagggca ggctttcttt tgtatctaaa gatttttcta atatttcatt agacacttta 7500 

aacgctacca acggcttaac gcttaacgct tctaaaaatg atafctagcgt tcaaaaaggt 7560 

caaatttgcg tgaatgtctt agattgcatg accgctaaag ggaaaaccac tcaaactaat 7620 

tcctcttcaa gcgcgacagc cccaactaat gaaacgctag aagtgagtgc gaataatttc 7680 

20 gctttcttag gcaccattaa ggctaatgga ttagtggatt tttcaaaagt cttacaaaat 7740 

acgactatcg ggactttgga tttagggcca aacgctactt ttaaagcgaa taatttgatc 78 00 

gtgaataacg cttttaataa taactctaat tacagggcta atatcagcgg taatttcaat 7860 

gtggccaagg gtgcaacttt tagcacgaat gaaaatggtt tgaatgtggg ggggaatttt 7920 

aacagcgaag ggccattaat ctttaatctt aataacccca cccatcaaac gattatcaat 7980 

2^ gtaactggca cttctacaat catgtcttat aacaatcaag ctttaatcaa ctttaacacc 8040 

caactcaagc aaggcgctta tacgcttatt aacgctaatc gcatggttta tggctatgat 8100 

aatcaaacga ttcttggggg gagcttgagc gattacctca aactttacac tctcattgat 8160 

tttaacggca aacgcatgca attgaatggc gattcgttaa gctatgacaa ccaaccggtc 822 0 

agtattaaag atgggggtct tgtggtaagc tttaaagaca atcaagggca aatggtgtat 8280 

30 tcatctatcc tttatgataa aatccaagtt accgtctctg ataaacccat gagcattcaa 8340 

gcccctagtt tggagtatta tgttaaacgc attcaaggta gtgctggttt gaatgcgatc 8400 

aaatctgcgg gcaataattc cattatgtgg ttgagtgagc tttttgcggc taaggggggt 8460 
aatcccttgt tcgcccctta ttatttgcaa gacaatccca ctgaacacat tgttacttta ' 8520 

atgaaagata ttaccagcgc tttaggaatg ctctctaact ccaatctcaa aaacaactcc 8580 

35 actgatgttt tacagcfccaa cacttacacg caacaaatga gccgtttagc caagctttct 8640 

aatttcgctt cttttgattc aacggatttt agcgaacgct tgagcagtct taaaaaccaa 8700 

agatttgccg atgctgtccc taatgcgatg gatgtgattt taaaatactc tcaaagggat 8760 

aaactaaaaa acaacctttg ggcgaccggt gttgggggcg tgagctttgt ggaaaatggc 8820 

acaggaacgc tctatggtgt caatgtgggc tatgatcgct ttgttagagg ggtaattgtt 8880 

40 ggagggtatg cggcttatgg gtatagcggg tfcttatgagc gcatcactag ttctaaatcc 8940 

gataatgtgg acgtgggttt gtatgctagg gcttttatca aaaagagcga gctgactttt 900 0 

agcgttaatg aaacttgggg ggctaataaa acccaaatca gctccaacga cgctttgctt 9060 

tctatgatca atcagtccta taaatacagc acatggacaa cgaacgcgaa agttaattac 912 0 

gggtatgatt tcatgtttaa aaacaaaagc atcattttaa aacctcaaat tggtttaagg 9180 

45 tattactata ttggtatgag cggtttagaa ggggtgatga ataacgtgct ctataaccag 9240 

tttaaagcga acgccgatcc gtctaaaaaa tccgttttaa cgattgattt tgctttggag 93 00 

aaccgccatt atttcaacac aaactcttat ttttatgcga ttggtggcgt tggtagagac 93 60 

ttgttagtta attctatggg ggataaattg gtgcgtttta ttggtaacaa cactttgagt 9420 

tacaggaaag gcgatcttta taacactttt gcgaacatca ctacaggcgg ggaagtgagg 948 0 

50 ttgtttaaaa gcttttatgc gaacgctggg gtgggggcta ggtttggatt ggattacaaa 9540 

atgatagata ttataggaaa tattggaatg cgtttagcgt tttaa 9585 
<212> Type : DNA 
<211> Length : 9585 

SequenceName : SEQ ID 474 

55 SequenceDe script ion : 

Sequence 



<213> OrganistnName : Helicobacter pylori, strain J99 

60 <400> PreSequenceString : 

atgaaacaat ttaaaaagaa accaaaaaag ataaaacgat cgcatcaaaa tcaaaaaaca 60 

atcttaaagc gtcctttatg gcttatgcct ttactgattg gcgggtttgc tagtggggtg 120 

tatgcggatg gaacagacat tttggggctt agttgggggg aaaaaagcca aaaggtatgc 180 

gtgcatcgtc catggtatgc tatatggagt tgcgataaat gggaggaaaa aacacaacaa 240 

65 tttacaggaa accaactcat cacaaaaact tgggcagggg gtaatgcggc taactactac 3 00 

cactctcaaa acaaccaaga catcacagcc aatttaaaaa atgataacgg cacttatttt 3 60 

ttaagcggtc tgtataacta caccggaggg gaatataatg gggggaattt agacattgaa 420 
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ttaggcagta acgctacttt taatctaggt 
tatcctaatg ggcatactga tgttactttt 
gtagaagtgg gcaatcgtgt gggatcggga 
aacttgaacg ctaataaggt tactatcaat 
5 gtgaatgtag gcaatgctaa cagcgttatt 
acttgcagtt ctttagctag ggtgggcgta 
tattctttta aagggacgac taacgctact 
ttcacttttg aagagaacgc cacttttagc 
ttcaataaaa agtttaacgc taccaataat 

10 aaaggcacaa gctcttttaa tggtgcgaat 
caagccactt tccaaaacag ctcctttaat 
aatcaaagca cccagcaccc ccaaattcaa 
cttaagggtt ttgcgacttt tgagcaagcc 
caaaacgctt cctttaataa cgctactttc 

15 gatgcgagct ttaataacac ttcgttcaac 
agtggtggcg ttactttaag cggtaaaaat 
gggagttcta aaatcactct cactcaaggg 
gagaagagcg taacgatttt aaattctaga 
catgcgatca atagcttgac aaacgcccta 

20 agtttcgctc aaggtttgtg ggatatgatc 
aatgaaaacg ctgcaacatc taaacccact 
tctacgcaag tctatcaagt gggttacaaa 
actttcagcc acaattccat tattattcag 
cctgtcatta acggctccaa attfcgactta 

25 ccttggtata accataaata ttatattcct 
tattacttgc cgagcgttca aatatggggg 
agcgcaagta atagcaatct ggtgattggg 
tcttctagcg acacggtggc ttttggggac 
gggccttggc cctattacca atgcacaggc 

30 gtctatatca cagcgaatct gcgttctggc 
ctaatcttta atggggtaga tagtatcaat 
gccggggctt attcaagctc tatgactttt 
ttgaatggcc taaattctaa cggcaagctt 
gccaaagacg ggaaattcat tttcaatgca 

35 aatggaggga gttaccaatt cagcggcgat 
aatagcggtt cgtttgagat tggcgcaaaa 
aacagcactt cttttaattt caataattct 
actaacgcta attcaaattt gcaaatcgct 
ggctctcaaa ataccgctaa ttttaataat 

40 acctttgata acgtggtatt taacagccct 
ctcaataaca tcactttaaa aaacttgaac 
gtttttagcg ctcattcggt gattaatatt 
acccttgtaa gctcttctaa agcaattgaa 
cagctcatca actaccaagg gcatggggct 

45 aatggcgtct atgatgtggt gtattctttc 
ttttcaccca acagcatttc tatccggcgt 
gatatggaaa aatcggatcg tttgtattat 
cctaatagct ataacaataa tttagggaat 
agcattgact tttatgcgag cgggaaaact 

50 ttcactgggc aaaacagcgc gatcgttttt 
gatgcgccgc aatctaatgt gatcattcgc 
gatgcgagtg ggcattgctg gaatttgcaa 
gcgcaaaaga tttacatcac cggcagtatt 
ggcgcgagcc ttaattttaa cgggcttcaa 

55 aaccgcgccg ctggcacgca aagctcttct 
caggctcaaa actcctattt tatagacgat 
agtttcaacg ctttgaatct ggatttttct 
acgcagtctg tttttaaatfc caatgccgtt 
ttaagctctg gtttgtatca aatgcaagct 

60 agcgtttcag tggggacaag cagcattaaa 
tctatcaatg cgagcaacca ttcaacctta 
accagctcgc tcaacctcaa ccaaagcgcc 
gattatgcga gcttgattgc gagtaatggc 
ttcaattcag cgaatattac tacgagtttg 

65 gtctctttac gagggcagtt taatttaagc 
agcgctatca cctctaacac ggcgtttaat 
atcactttcc atcaagccct tgacattaaa 



gcgagtagtg ggaatagctt cacttcttgg 480 

agcgctggga ctatcaatgt gaataacagc 540 

gctggcacgc acaccggcac agccacttta 600 

tccaatatca gcgcgtataa aacttcgcaa 660 

accattaatt cggtttcttt aaatggggat 720 

ggggctaatt gctccacttc tgggcctagc 780 

aacacgactt ttagcaattc aagcggcagt 840 

ggggcgaaat taaatggggg ggcattcact 900 

accgctttta atagcggtag ttttactttt 960 

tttagtaacg cttcctatac ttttaataat 1020 

ggggggactt ttacttttaa tgaccagacc 1080 

aacagctctt ttagcggcag tgctaccact 1140 

tttaacaatt caaaccacca actaacgata 12 00 

aacaataccg gtaaaatcac tatagaaaaa 1260 

actcctgttg atacaaacaa catgactatt 1320 

gacttgaaaa atggtgcaac ccttgatttt 13 80 

acgactttca acctcacaag tttaggcagt 1440 

ggtgggatca cttacaatca tcttttaaac 1500 

aaaacgaacg aaagctcttc aaaaccgcaa 1560 

acttacaatg gggttaccgg gcagcttttg 1620 

gactcttcgc cctctaaatc ctctacaaac 1680 

ataggggata ctatctacaa actgcaagaa 1740 

gctttagaga gcgggactta cacgccaccc 18 00 

tccgcttcaa attatatcaa tgctgacatg 1860 

aaatcccaaa attttacaga gagcgggact 1920 

agctacacta actcgtttaa acaaaccttt 1980 

tataacgcaa catggactga tcacaatgtt 2040 

acttcaggga gcgctcttaa tgggcattgc 2100 

acgactaacg gcacttatag cgcttatcat 2160 

aatcgtatag gcaccggtgg ggcagccaat 2220 

atcgctaacg ctaccatcac gcaacataac 2280 

tccacgcaaa acatggacaa ttcgcagaat 2340 

ttggtgtatg gcacaacttt cactaaccaa ■ 2400 

gggcaagcga cttttgaaaa caccaacttt 2460 

agcttgaatt tttcaaataa caaccagttc 2520 

aatactattt ttaataacgc taattttaac 2580 

agcgcgacca cttcgtttgt gggggatttc 2640 

gggaacgctg tttttgggaa ctctactaat 270 0 

accggctctg ttaatattgc agggaatgca 2760 

acgaacacga gcgtgaaagg gaaagttact 282 0 

getcctttgt cttttggcga tgggacgatt 2880 

ggtgaagcta tcacaaatgg caaccctatc 294 0 

tacaacgacg ctttcagtaa aaatctatgg 3000 

agcagtgaaa agctcgtttc tagtgcgggt 3 060 

aacaaccaaa cctacaattt ccaagaggtt 3120 

ttgggcgttg gcatggtgtt tgattatgtg 3180 

caaaacgctc tcggttttat gacctacatg 3240 

ttaaacaaca ccatttacta ttacgacaac 33 00 

ctattcacta aagcggaatt ttctcaaacg 33 60 

ggggctaaaa atatatggac gagcgtaagc 3420 

tttggggaca ataagggagc agggagtaat 3480 

tgcataggct ttatcacagg gcattatgaa 3540 

gaaagcggga accgcatttc tagcggtggg 3600 

ggcattcttt taacgaacgc gactttgtat 3660 

atgaattttg tttctaacag cgcgaacatt 3 720 

accgcacaaa ataaaggcaa ccctaatttt 3780 

aacagctctt ttagaggcta tgtggggcaa 3840 

aatgcgatca gtttcactaa cagctctaat 3900 

aaaagcgtgt tgtttgacaa ttccaattta 3960 

gccaatgcga tcaatctttc tcaaaacgcc 4020 

gaacttcaag gcgatttgaa tttgaacgac 4080 

attaatgttt ctaacaacgc cacgatcaac 4140 

tctcacctta attttaacgg ggcggttaat 4200 

agtagttcct ctatcgtgtt taagggggcg 4260 

aataattctt ctttagattt tcaaggctct 4320 

ttctatgata acgctttttc tcaaagcccc 4380 

gtgcccttga gtttgggagg caacctctta 4440 
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aaccctaaca acagtagcgt gctgaattta aaaaacagcc agcttgtttt tagcgatcaa 450 0 

gggagcttga atatcgctaa cattgattta ctaagcgatc tgaatggtaa taaaaatcgt 4560 

gtgtataaca tcattcaagc ggacatgaat ggtaattggt atgagcgtat caacttcttt 4620 

ggcatgcgca ttaatgatgg gatttatgac gctaaaaacc aaacttatag tttcactaac 4680 

5 cctctcaata acgccctaaa aatcaccgag agctttaaaa ataaccaact gagcgttacg 4740 

ctctctcaaa tcccgggcat taaaaacacg ctctataaca ttggctctga aatctttaac 4800 

taccaaaagg tttataacaa cgctaatggc gtgtattctt atagcgatga cgcacaaggc 4860 

gtgttttatc tcacgagcag cgtgaaaggc tattacaacc ccaaccaatc ctatcaagcc 4920 

agcggcagca ataacaccac gaaaaataac aatctaacct ctgaatcttc tgtcatttcg 498 0 

10 caaacctata acgcgcaagg caaccctatc agcgcgttac acgtcfcataa caagggctat 5040 

aatttcagta atatcaaagc gttagggcaa atggcgctca aactctaccc tgaaatcaaa 5100 

aagatattag ggaatgattt ttcgctttca agtttgagca atttaaaagg cgatgcgcta 5160 

aaccagctta ccaagctcat cacgcctagc gattggaaaa acattaacga gttgattgat 5220 

aacgcaaaca, attcggtcgt gcaaaatttc aataacggca c&ttgattat aggagcgact 5280 

15 aaaatagggc aaacagacac caatagtgcg gtggtttttg ggggcttggg ctatcaaaag 5340 

ccttgcgatt acactgatat tgtgtgccaa aaatttagag gcacttattt ggggcagctt 540 0 

ttggagtcca tctcggctga tttgggctat attgacacga cttttaacgc taaagaaatt 5460 

tatcttaccg gcactttagg gagcgggaac gcatggggga ctggggggag tgcgagcgta 552 0 

acttttaaca gccaaacttc gctcattctc aaccaagcga atatcgtaag ctcgcaaacc 558 0 

20 gatgggattt ttagcatgct gggtcaagag ggcatcaata aggttttcaa tcaagccggg 5640 

ctcgctaata ttttgggcga agtggcaatg caatccatta acaaagccgg gggattaggg 570 0 

aatttgatag taaatacgct agggagtgat agcgtgattg gggggtattt aacgcctgag 5760 

caaaaaaatc aaaccctaag ccagcttttg gggcagaata attttgataa cctcatgaac 582 0 

gatagcggtt tgaacacggc gattaaggat ttgatcagac aaaaattagg cttttggacc 588 0 

25 gggctagtgg ggggattagc cggactgggg ggcattgatt tgcaaaaccc tgaaaagctt 594 0 

ataggcagca tgtccatcaa tgatttattg agtaaaaagg ggttgttcaa tcagatcacc 60 0 0 

ggctttattt ccgctaacga tatagggcaa gtcataagcg tgatgctgca agatattgtc 6060* 

aagccgagcg acgctttaaa aaacgatgta gccgctttgg gcaagcaaat gattggcgaa 6120 

tttttaggcc aagacacgct caattcttta gaaagcttgc tgcaaaacca gcagattaaa 618 0 

30 agcgttttag acaaagtctt agcggctaaa ggattagggt ctatttatga acaaggtttg 6240 

ggggatttga tccctaatct tggtaaaaag gggattttcg ctccctatgg cttgagtcaa 63 0 0 

gtgtggcaaa aaggggattt tagtttcaac gcgcaaggca atgtttttgt gcaaaattcc 63 60 

actttctcta acgctaatgg aggcacgctc agttttaacg caggaaattc gctcattttt 6420 

gccggaaaca accacatcgc tttcactaac cattctggaa cgctcaattt gttgtctaat 6480 

35 caagtttcta acattaacgt caccatgctt aacgctagca acggccttaa gattaacgcc 6540 

actaataaca atgtttccgt gtctcaaggc aatctgttta tcaacgctag ctgcgtgcaa 660 0 

caaagcgatc caacgacagc tagcgccaca aacccttgca ccaccgctca aaataacgct 6660 

tcttctagta atgcgtcaaa caacgcgcca atcgccttaa ataataacga tgaaagcttg 6720- 

gtggttacgg cgaatggttt caatttttca ggcaatattt acgctaacgg ggtggttgat 6780 

40 ttttcaaaaa ttaaaggctc tgcaaacgtt aaaaacctgt atctttacaa taacgctcaa 6840 

ttccaagcca acaacctcac gatttccaac caagcggtat tagagaaaaa cgctagcttt 690 0 

gtaacgaata acttaaacat tcaaggagcg tttaacaaca acgccacgca aaaaatagag 6960 

gtgcttcaaa atttagtgat cgcttcaaac gcttctttaa gcaccgggat ttatgggtta 702 0 

gaagtagggg gggcattgaa taatttggga gcgatccatt ttaatttaga aaattctcaa 708 0 

45 acgcctgtaa atccgctcat tcaagtaggg gggatcatta atctcaacac cacccaaacg 7140 

ccttttatga atgtcagcgt ggctaatggc ggaacttaca ctttattaaa aagcagccgt 72 0 0 

tatattgatt acaatatcaa ccctaacagc ttgcaatcgt atttgaagct ctatacctta 7260 

atcaatatca acggaaacca catagaggaa aaaaacggcg tattgactta tttgggccaa 7320 

cgggttttat tacaagataa ggggttatta ttgagtgtag cactacctaa ctcaaacaac 73 80 

50 gcctctcaaa acaacatttt aagcctttct gtccttcaca accagattaa aatgtcttat 7440 

ggtaataaag tgatggactt tacccctccc accttacagg attacattgt gggcattcaa 750 0 

ggacaaagcg cactcaatca aattgaagct gttgggggga ataacgctat caagtggctt 7560 

tcaacattga tgatggagac taaagaaaac ccgctttttg cgccgattta tttagaaaac 7620 

cactctttaa atgaaatctt aggcgtaaca aaagatcttc aaaacaccgc aagcttgatt 7680 

55 tctaacccta attttagaaa taacgctacc agccttttag aaatggcgag ttacacccaa 7740 

caaaccagcc gtttgacaaa actctctgat tttagggcta gagagggaga gtccaatttt 78 00 

tcagagcgct tgttagagct taaaaacaag cgttttagcg atcctaaccc tagtgaggtt 7860 

tttgtcaaat actctcaact cagcaaacac cccaataacc tttggattca aggggtggga 792 0 

ggagcgagct ttatttctgg gggcaatggc acgctttatg gcttgaatgt gggctatgac 7980 

60 cgattggtta aaagcgtgat ccttgggggt tatgtggctt atggctatag cggttttaac 8040 

gggaacatca tgcattcttt ggctaataat gtggatgtgg ggatgtatgc gagggctttt 8100 

ttgaaaagaa acgaattcac tttgagcgcg aatgaaactt atggaggcaa tgcgagtcat 8160 

atcaattctt ctaattcctt gctctctgtg ttgaaccaac gctacaacta caacacctgg 8220 

acaacgagcg tgaatgggaa ttacggctat gatttcatgt tcaaacaaaa aagcgtggtg 8280 

65 ctaaaacctc aagtgggctt gagctatcat ttcataggct tgagcgggat gaaaggtaaa 8340 

atgcaaaatc cagcttacca acaattcgtc atgcattcaa acccttctaa cgaatcggtt 8400 

ttaacgctca acatggggtt agagagccgt aaatattttg gtaaaaattc ctattatttt 8460 
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gtaacggcga ggttgggtag ggatcttttg atcaaagcta aaggcgacaa tgtggtgcgt 852 0 
tttgtgggtg aaaacacttt attgtaccgc aagggggaaa tttttaacac ttttgcgagc 858 0 
gtgatcacag gaggcgaaat gcatttgtgg cgtttgatgt atgtgaatgc gggggtgggg 8640 
cttaaaatgg gcttgcaata ccaagatctt aatatcactg ggaatgtggg catgcgagtg 870 0 
gcgttttag 8709 
<212> Type : DNA 
<211> Length : 8709 

SequenceName : SEQ ID 475 

SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<4 00> Pre Sequences t ring ... 

15 atggcgttta aaaaggccag attgatttcc aggtttattt caaaaggatc tttcaaattg 60 

aataagatct caaagaaatt tfctcacattg aatcaaatct taaagcgtga aaagccctta 120 

aaacgccata aaaaaacaaa atctattgaa aagcccttta ataaaaacaa atctttttta 180 

aaagcttcgg ttttattgat aggagcgcta ggggggttat cccacctaag ggctaacgaa 240 

tgccgttatt ggtcatggtc gtcttggagt tatcaagaca atattgaaag cggtcctaat 3 00 

20 tcacccacgc acaactctta ttgtcttttt agtagcgctc aaggctctgg gacttattat 3 60 

ttaaacactc ttaccactta tagcgctggt ggggctagtt tcacgcaaaa attcaatggt 420 

ggcacgcttg atataggggg gaatatccgc tttggaggca caggtattaa tggaggtgat 48 0 

gtagggtata tcactggaac ttataatgct caaacgatga attttaattc tagccatatc 540 

acaaccggaa actcatacgc tgatggcggt gggaccacgc tcaattttaa cgcgactaac 600 

25 aatatcacta tcaatcaagc gagctttgat aacagcgatg cagggacaca aaaatcttac 660 
atgaatttta aaggctctaa tatcaagatc agtggctcta gctttacaga cgacaccaat 720 

ggaggtttta atttcagcgg taataacaat aatagcacca tctctttcaa tcaaaccagc 780 
ttcaatcaag ggacttataa ttttagtaac agcgccactt taagcttcaa taacagcaat 84 0 

ttcaatcaag ggacttatca ctttaacagc gcccaatcca cttttgaaaa cagcaatttc 900 

30 aatcaaggca cttataattt taatgacaat actagcttta ataacgacac cttcaatcaa 960 

ggcacttata attttaatag cagcaaggtg agtttttcag gcgctaacac tttaaattca 1020 

agttcgcctt ttgctagcct taaaggcagt gtgtctttta attctggtgc gatttttaac 1080 

ctcaatcaaa cccttaataa taatcaaacc tatgacattc tcactacaaa cggagcgatc 1140 

cagtatgggg tttatcaaag cfcatttgtgg gatctaatca actataaggg cgataaagcc 1200 

35 attagccatg ttgaagtgag taataacact tatgatgtaa cctfctgacat taacgggcaa 1260 

gatgaaacct tacaagaaac ctttagcaac caatctatta ttacccaatt tttaggagac 1320 

gatttacaac aacaagccca acaaacctat caagaggatg tagctaattc ccagaacgct 13 80 

ttgaataagg ttgctagcga caacacgatc gcaaataacg atacaagcta cactcaaagc 1440 

agtaacccca ctatccttaa agacgctcaa ggtttagaaa acaccaacca acaaatccaa 1500 

40 caagacgaaa aagccttaga aaaagattta gcccaaatca agcaattagc caactccacc 1560 

acaggcttta acgaacaagc tttcactcaa gctcaaaaac aagaacaaca agatgaacaa 1620 

gccttacaga acgatgaaaa cgcttttaat acggaacaag agggattaga acaagcgata 1680 

gctaacgcta aacatgccaa ccccacacca aatccgacac caagccccac acccactcct 1740 

ataaaacaca cagcgccaaa cactccccct agtcaagtcc cgcccacacc ccctagtcaa 18 00 

45 aatttaccta aaacaaatgt gtggaatggg gtttattggc ttcaaaacaa aacttactca 1860 

aacaaaggca tttattatat tgatcccaat ctttcaggac agagcggtca aagcggcaac 192 0 

acgctcagca cctatacagc taatttgtta gggagaagtt ttggcgtcaa tgctaacaat 1980 

ggcactttga tcatagggaa taatacagag agtgtgaatg ataacgggtt gatttggata 2 040 

gggcatggag gctttggcta tattacggga acttttagtg cggctaacat ttacttgacc 2100 

50 aataatttta aaaccggtga aggcgtttca aattcagatg gtgggggagc gaacattacc 2160 

tttaaagcaa gcgataatat cactatggat ggcttgaatt acaataacgc tgaaaccgtt 222 0 

actaaaatga ttcaaacagg ggccagtcag cattcctata ccacttttga cgctaccaat 228 0 

aatatcagtg taactgattc tgattttagc gatatgactt gggggaaatt cagttttagc 2340 

gctaagaata tttcgttttc taacgcttcg ttcagcggct ttacaaaccc tggaggatca 24 0 0 

55 agcactatca gcacgaatgc ttctaattct ttaagcttta cagattctcg cttgaatggt 2460 

ggagcaatct ataatttaca ggctaatagc cttattttca ataacacgca agcggttttt 2 520 

aatgtcttgt attctagggg gacaagcaat tttaacgcca ccacacagct, tttaggcaac 2580 

acgagtttta cgcttagctc tcaaagtttg ctaaacttta atggcgatac aaccttgcaa 2640 

aacaacgcta atatcacgct tggcaataaa agtcaagccg cttttaaaaa ttcfcttaacg 2700 

60 cttgataaca attctaattt aagcttagac aatcaaagcg ttttgaacgc gaatggcacg 2760 

agtgctttta acaatcaagc gagtctcaac atttataatg ggagtcaagc ggcctttagc 282 0 

agtctctttt ttaatggcgg aacactcagt cttaacgcga atagcaagct caacgcttct 2880 

agcgctagtt tttcaaacaa caccactatt aatttagacg atagcgtttt gaatgcgaat 2940 

aacacaagct ctttaaacgc taatatcaat tttcaaggcg caagccaggc tgattttgga 3 00 0 

65 ggcaacacga ctattgatac agcaagcttt aattttgaca gcgcaagttc attgaatttt 3 060 

aataacctta cggctaatgg ggcgttaaat tttaatggtt atgcgccctc tttaactaag 3120 

gctttaatga atgtcagcgg gcagtttgtt ttagggaata atggggatat taatttatct 3180 
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gacatcaata tctttgacaa catcacaaaa tctgtaactt acaacatctt aaacgctcaa 3240 

aaagggatta ctggcattag tggggctaat ggctatgaaa aaatcctttt ttatggcatg 33 00 

aaaatccaaa acgctaccta tagcgataat aacaacatcc aaacttggtc gtttataaac 3 3 60 

cctctcaatt cttctcaaat cattcaagag agcattaaaa atggggatct aaccatagaa 3420 

5 gttttaaata accctaactc ggcttccaac actattttta atatcgctcc tgagctttat 34 80 

aattaccaag attctaagca aaatcctacc ggctatagct atgattatag cgacaatcaa 3 540 

gcaggcactt attacttgac aagcaacatt aaaggtcttt tcacccctaa aggctctcaa 3600 

acgcctcaaa ccccaggcac ttatagccca tttaaccagc ctttgaatag tttgaatatc 3660 

tacaataagg gtttttctag cgagaattta aaaacgcttt tagggatcct ttctcaaaat 3720 

10 tccgccacct taaaagaaat gattgaatcc aaccaactag acaatatcac fcaacattaat 3780 

gaagtgttgc aactcttaga taagattaaa atcacccaag cgcaaaagca agcgctccta 3 840 

gaaacgatca accatttgac tgacaacatc aatcaaacct ttaataacgg gaatctcgtt 39 00 

ataggcgcta cccaagataa tgttacaaac tctactagct ctatatggtt tgggggcaat 3 960 

ggctatagca gcccttgcgc gctagatagc gccacttgtt cttcttttag aaacacttac . 4020 

15 ttggggcaat tattaggctc aacttcccct tatttaggct acattaacgc tgattttaaa 40 80 

gctaaaagca tttatattac cgggacaatt ggaagtagta acgcttttga aagcggaggg 4140 

agcgcggatg taacctttca aagcgctaat aacttagtgt tgaataaagc taacatagaa 4200 

gctcaagcca cagacaatat ctttaatctt ttgggtcaag aagggattga taaaatcttt 42 60 

aatcagggga atttagcgaa tgttcttagt caaatggcta tggaaaaaat caagcaagcc 4320 

20 ggcggtttag ggaactttat agaaaacgct ctaagccctt tgagtaagga attacccgct 43 80 

agcttgcaag atgaaacctt aggccaactt ataggtcaaa ataacttaga tgatttattg 4440 

aataatagtg gagtcatgaa tgaaatccaa aacattatca gtcaaaaact aagcattttt 45 00 

ggcaattttg ttaccccatc catcatagaa aactaccttg ctaagcagtc tttaaaaagc 4560 

atgctagacg ataaagggct tttgaatttt atcggtgggt atatagacgc ttctgaatta 4620 

25 agctctattt taggcgtgat tttaaaggat attactaacc cccctacaag cctgcaaaaa 4680 

gacattggtg tggtagcgaa cgacttgttg aacgagtttt taggacaaga tgttgtcaaa 4740 

aagctagaaa gtcaaggctt ggtgagtaat atcatcaata atgttatttc tcaaggcggg 4 8 00 

ttgagcggcg tttataatca aggtttaggg agcgtgttgc cgccctcttt acaaaacgcg 4860 

ctcaaagaaa acgatttagg cactctttta tcgcctagag gcttgcatga tttttggcaa 4920 

30 aaagggtatt ttaacttttt aagcaatggc tatgtttttg tcaataacag ctcttttagt 4980 

aacgctactg ggggtagttt gaattttgtc gccaacaagt ctattatctt taatggcgat 5040 

aatacgattg actttagcaa gtatcaaggc gcattgattt ttgcttctaa tggtgtttct 5100 

aatatcaata tcaccaccct aaacgccact aatggcttaa gccttaatgc gggtttgaat 5160 

aatgtgagcg ttcaaaaagg agaaatttgt atcaatttag ccaattgccc tacaaccaaa 5220 

35 aacagctctc ctgcaaactc tagcgtaacc cccactaatg agtctttaag cgtgcacgct 5280 

aataatttca ctttcttagg cacaatcatc tctaatgggg ctattgattt gtctcaagta* 5340 

acaaataata gcgttatagg cacgctcaat ctcaatgaaa atgcgacctt gcaagctaat 5400 

aatttaacga tcaccaacgc ttttaacaac gcctctaact ctacggctaa tattgatggt 5460 

aatttcacct taaaccaaca agcgacttta agcactaacg ctagtggttt gaatgtcatg 5520 

40 gggaatttta atagctatgg cgatttggtg tttaacctca gtcattcagt tagtcatgct 5580 

attatcaata ctcaaggcac agcgacgatc atggccaata ataacccttt gatccaattc 564 0 

aacgcttctt caaaagaagt gggtacttac acgctgattg atagcgctaa agccatttat 5700 

tacgggtata acaaccaaat cacaggaggc agtagcctgg ataattacct taagctttat 5760 

gcgctcattg atattaatgg caagcacatg gtgatgactg acaacggctt aacctataac 5820 

45 gggcaagccg tgagcgttaa agatggcggt ttagttgtag gctttaagga ctctcaaaat 5880 

caatacattt acacttccat tctttataat aaagtgaaaa tcgctgtttc taatgatcct 5940 

atcaataacc cacaagcccc cactttaaaa caatatatcg ctcaaattca gggcgttcaa 60 00 

agcgtggata gcatcgatca agctggggga aatcaagcga ttaattggct caataaaatc 6060 

tttgaaacta aaggaagccc tttattcgct ccctattatc tagagagcca ctccacaaaa 6120 

50 gatttaacca cgatcgctgg agatattgct aacactttag aagtcatcgc taaccctaat 6180 

tttaaaaatg acgccactaa tattttacag atcaacacct acacgcagca aatgagtcgt 6240 

ttagccaagc tctctgacac ttcaactttc gcccgttctg atttcttaga acgcttagaa 63 00 

gcccttaaaa acaagcgatt cgctgatgcg atccctaacg ctatggatgt gattttaaaa 63 6 0 

tactctcaaa ggaatagagt taaaaataat gtgtgggcga caggagttgg aggggctagt 6420 

55 ttcattagtg gaggtactgg aactttatat ggtatcaatg tagggtatga taggtttatt 6480 

aagggcgtga ttgtgggagg ttatgccgct tatgggtata gcgggtfccca tgcaaacatc 654 0 

actcaatcag gctctagcaa tgtcaatgtg ggcgtttata gccgagcgtt tatcaaaaga 6600 

agcgagctaa ccatgagctt gaatgagact tggggataca ataaaacttt catcaactcc 6660 

tatgaccccc tactctcaat catcaatcag tcttacagat acgacacttg gacgactgac 6720 

60 gctaaaatca attatggcta tgafcttcatg tttaaagata aaagcgttat ttttaaaccc 678 0 

caagtaggct taagctatta ttacattggt ttgtctggtt fcaaggggcat tatggatgat 6840 

cctatttaca accaattcag agccaatgct gaccctaata aaaaatccgt tctaacgatc 690 0 

aattttgccc tagaaagtcg gcattatttc aataaaaact cttattattt tgtgattgcg 6960 

gatgtgggca gagacttatt cattaattct atgggggata aaatggtgcg tttcatcggt 7020 

65 aataacaccc taagctatag agatggtggc agatacaaca cttttgctag cattatcaca 7080 

ggcggggaga taagattgtt caaaaccttt tatgtgaatg cgggcatagg ggctaggttt 7140 

gggcttgatt ataaagatat taatattacc ggaaatattg gtatgcgcta tgctttttaa 72 0 0 
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<212> Type : DNA 

<211> Length : 7200 

SequenceName : SEQ ID 476 
SequenceDescription : 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 

10 <400> PreSequenceString : 

atggaaatac aacaaacaca ccgcaaaatc aatcgccctt tagtttctct cgttttagca 60 

ggagcgttga ttagcgccat accgcaagag agtcatgccg cctttttcac gaccgtgatc 120 

attccagcca ttgttggggg tatcgccaca ggcactgctg taggaacggt ctcagggctt 180 

cttagttggg gactcaaaca agccgaagaa gcgaataaaa ccccagataa acccgataaa 24Q 

15 gtttggcgca ttcaagcagg aaaaggcttt aatgaatttc ctaacaagga atacgactta 3 00 

tacaaatccc ttttatccag taagattgat ggaggttggg actgggggaa cgccgctagg 3 60 

cattattggg tcaaaggcgg gcaatggaac aagcttgaag tggatatgaa agacgctgta 420 

gggacttata aactatcagg gcttagaaac tttactggtg gggatttaga cgtgaatatg 480 

caaaaagcca ctttgcgttt gggccaattc aatggcaatt ctttcacaag ctataaggat 540 

20 agcgctgatc gcaccacgag agtgaatttc aacgctaaaa atatttcaat tgataatttt 600 

gtagaaatca ataatcgtgt gggttctgga gccgggagaa aagccagctc tacggttttg 660 

actttgcaag cttcagaagg gatcactagc agtaaaaatg cggaaatttc tctttatgat 720 

ggcgccacgc tcaatttggc ttcaaacagc gttaaattaa atggtaatgt gtggatgggc 780 

cgtttgcaat acgtgggagc gtatttagcc ccttcataca gcacgatcaa cacttcaaaa 840 

25 gttcaagggg aagtggattt taaccatctc actgtggggg atcaaaacgc cgctcaagcg 900 
ggcattatcg ctagcaataa gactcatatt ggcacactgg atttgtggca aagcgccggg - 960 

ttaaatatca ttgcccctcc agaaggtggc tacaaggata aacctaatag taccacttct 1020 

caaagtggca ctaaaaacga caagaaagag atcagtcaaa ataacaatag caacacagag 108 0 

gtcattaacc cacccaataa cacgcaaaaa acagaaactg aacccacgca agtcattgat 1140 

30 gggccttttg ctggcggcaa agacacggtt gtcaatattt tccacttaaa cactaaagcc 12 00 

gatggcacga ttaaagtggg agggtttaaa gcttctctta ccacgaatgc ggctcatttg 12 60 

aatatcggca aaggcggtgt caatctgtcc aatcaagcga gcgggcgcac ccttttagtg 1320 

gaaaatctaa ccgggaatat caccgttgat gggcctttaa gagtgaataa tcaagtgggt 138 0 

ggctatgctt tggcaggatc aagcgcgaat tttgagttta aggctggtgt ggatactaaa 1440 

35 aacggcacag ccactttcaa fcaacgatatt agtttgggaa gatttgtgaa tttaaaggtg 1500 

gatgctcata cagctaattt taaaggtatt gatacgggta atggtggttt caacacctta- 1560 

gattttagtg gtgttacaga caaagtcaat atcaacaagc tcatcacagc ttccactaat 162 0 

gtggccgtta aaaacttcaa cattaatgaa ttgattgtta aaaccaatgg gataagtgtg 1680 

ggggaataca ctcattttag cgaagatata ggcagtcaat cgcgtatcaa taccgtgcgt 1740 

40 ttggaaactg gcactaggtc aatcttttct gggggtgtca aatttaaaag cggtgaaaaa 18 00 

ctagttatca atgattttta ctatagccct tggaattatt ttgacgctag gaatgttaaa 1860 

aatgttgaaa tcaccagaaa attcgcttct tcaaccccag aaaacccttg gggcacatca 1920 

aagctcatgt ttaataatct aaccttgggt caaaatgcgg tcatggacta tagtcaattt 198 0 

tcaaatttaa ccattcaggg ggattttatc aacaatcaag gcactatcaa ctatctggtc 2040 

45 cgaggcggga aagtggcaac cttaaatgta ggcaatgcag cagctatgat gtttaataat 2100 

gatatagaca gcgcgaccgg attttacaaa ccgctcatca agattaacag cgctcaagat 2160 

ctcattaaaa atacagagca tgttttattg aaagcgaaaa tcattggtta tggtaatgtt 222 0 

tctacaggta ccaatggcat tagtaatgtt aatctagaag agcaattcaa agagcgccta 228 0 

gccctttata acaataataa ccgcatggat acttgtgtgg tgcgaaatac tgatgacatt 2340 

50 aaagcatgcg gtatggctat cggcaatcaa agcatggtga acaaccctga caattacaag 2400 

tatcttatcg gtaaggcatg gagaaatata ggcatcagta aaacggctaa cggctctaaa 2460 

atttcggtgt attatttagg caattctacg cctactgaga atggtggcaa taccaccaac 2520 

ttacccacaa acaccactaa taatgcgcat tctgctaact acgctctcgt gaagaacgct 258 0 

cctttcgctc acagcgccac tcctaattta gtcgctatca atcagcatga ttttggcact 2640 

55 attgagagcg tgtttgaatt ggctaaccgc tctaaagata ttgacacgct ctatactcat 2700 

tcaggcgcgc aaggcaggga tctcttgcaa actttattga ttgatagcca tgatgcgggt 2760 

tatgccagac aaatgattga taacacaagc accggtgaaa tcaccaagca attgaatgcg 2820 

gccactgacg ctttaaacaa cgtagccagt ttagagcata aacaaagcgg cttacaaacc 28 8 0 

ttgagcttga gtaatgcgat gattttaaat tctcgtttag tcaatctctc taggaagcac 2940 

60 accaaccata ttaactcgtt cgctcaacgc ttacaagctt taaaaggcca agaattcgct 3000 

tctttagaga gcgcggcaga agtgttgtat caatttgccc ctaaatatga aaaacctacc 3060 

aatgtttggg ctaacgctat tgggggagcg agcttgaata gcggctctaa cgcttcattg 3120 

tatggcacaa gcgccggcgt agacgctttc cttaacggga atgtggaagc cattgtgggc 3180 

ggttttggaa gctatggtta tagctccttt agcaatcaag cgaactctct taactctggg 3240 

65 gccaataacg ctaattttgg cgtgtatagc cgtttttttg ccaaccagca tgaatttgac 3300 

tttgaagctc aaggggcgct agggagcgat caatcaagct tgaatttcaa aagcactcta 33 60 

ttacaagatt tgaatcaaag ctataattac ttagcctata gcgccacagc aagagcgagt 3420 
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tatggttatg acttcgcgtt ttttaggaac gctttagtgt taaaaccaag cgtgggcgtg 3480 

agctataacc atttaggttc aaccaacttt aaaagcaata gccaatcaca agtggcttta 3540 

aaaaatggcg cgagcagtca gcatttattc aacgctaacg ctaacgtgga agcgcgttat 3 600 

tattatgggg acacttcata cttttatttg catgcgggag ttttacaaga gttcgctcac 3 660 

5 tttggatcga atgatgtggc gtctttaaac acctttaaaa tcaatgccgc tcgcagtcct 3720 

ttaagcacct atgcaagagc gatgatgggt ggggaattgc aattggctaa agaagtgttt 3780 

ttgaatttgg gcgtggttta tttgcacaat ttgatttcca acgcaagcca tttcgcttcc 384 0 

aatttaggaa tgaggtatag tttctaa 3 867 
<212> Type : DNA 
10 <211> Length : 3867 

SequenceNarae : SEQ ID 477 

SequenceDe script ion : . 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaaac acatcctttc attaacttta ggatcgcttt tagtttccac tttgagcgct 60 

gaagacgacg gcttttacac aagcgtaggc tatcagatcg gtgaagccgc tcaaatggta 120 

20 acaaacacca aaggcatcca agatctttca gatcgttatg aaagtttgaa caaccttttg 18 0 

aatagataca gcaccctaaa cacccttatc aaattgtccg ctgacccgag cgcgattaat 240 

gcggtgcggg aaaatctggg cgcgagcgcg aagaatttga tcggcgataa agccaattcc 3 00 

ccggcgtatc aagccgtgct tttagcgatc aacgcggcgg tagggttttg gaatgtcgta 3 60 

ggctacgtga cgcaatgcgg gggtaacgcc aatggtcaaa aaagcatctc ttcaaagacc 420 

25 atcttcaaca acgagccagg gtatcgatcc acttccatca cttgttcttt gaacgggcat 48 0 

tctcctggat actacggccc tatgagcatt gaaaatttca aaaagcttaa cgaagcctac 540 

cagatcctcc aaacggcttt aaaacgaggc ttgcccgcgc tcaaagaaaa caacgggaaa 600 

gtcaatgtaa cctatactta cacatgctca ggggacggga ataataactg ctcgtcacaa 660 

gtcacaggtg taaataatca aaaagacgga accaagacta aaatccaaac catagacggc 72 0 

30 aaaagcgtaa ccaccacgat cagttcaaaa gtggttgata gtcgtgcaga tggtaataca 780 

acaggggtgt cctacaccga aatcaccaac aaattagaag gtgtgcctga tagcgctcaa 84 0 

gcgctcttag cgcaagcgag tacgctcatt aacaccatca acaacgcatg cccgtatttc 90 0 

catgctagta atagtagtga ggctaacgcc ccaaaattct ctactactac tgggaaaata 960 

tgcggcgctt tttcagaaga aatcagcgcg atccaaaaga tgatcacgga cgcgcaagag 1020 

35 ctggtcaatc aaacgagcgt cattaacgag catgaacaaa caactccggt aggcaataac 1080 

aatggcaagc ctttcaaccc tttcacggac gctagttttg cgcaaggcat gctcgctaac 1140 

gctagtgcac aagccaagat gctcaatcta gccgaacaag tggggcaagc cattaaccct 1200 . 

gagaggctta gcgggacttt tcaaaatttt gttaaaggct ttttagccac atgcaacaac 1260 

ccatcaaccg ctggtactgg tggcacgcaa ggttcagctc caggcacagt taccactcaa 132 0 

^0 actttcgctt ccggttgcgc ctatgtagga caaacgataa caaatcttaa aaacagcatc 13 80 

gcccattttg gcactcaaga gcagcagata cagcaagccg aaaacatcgc tgacactctg 144 0 

gtgaatttca aatctagata cagcgaattg ggcaacactt ataacagcat caccactgcg 150 0 

ctctctaata tccctaacgc gcaaagcttg caaaatgcgg tgagtaaaaa gaataacccc 1560 

tatagcccgc aaggcataga caccaattac tacctcaatc aaaactctta caaccaaatc 162 0 

45 caaaccatca accaagaact cgggcgtaac ccctttagga aagtggggat tgttagttct 1680 

caaaccaata atggcgcgat gaatgggatc ggtattcagg tgggttacaa acaattcttt 1740 

ggccaaaaaa gaaaatgggg cgctaggtat tacggctttt ttgattacaa ccatgcgttc 18 0 0 

attaaatcca gcttcttcaa ctcggcttct gacgtgtgga cttatggctt tggagcggac 1860 

gctctttata atttcatcaa cgataaagcc accaatttct taggcaaaaa caacaagctt 192 0 

50 tctgtggggc tttttggggg tattgcatta gccgggactt catggcttaa ttctgagtat 19 80 

gtgaatttag ccaccatgaa taacgtctat aacgctaaaa tgaacgtggc gaatttccaa 2040 

ttcttattca acatgggagt gaggatgaat ttagccaggc ctaagaaaaa agacagcgat 2100 

catgcggctc agcatgggat tgagttaggg cttaaaatcc ccaccatcaa cacgaactac 2160 

tactccttta tgggggctga actcaaatac cgaaggctct atagcgtgta tttgaattac 2220 

55 gtgttcgctt attaa 2235 
<212> Type : DNA 
<211> Length : 2235 

SequenceName : SEQ ID 478 
SequenceDescription : 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 
65 atgataaaaa aagctaaaaa attcatacca ttctttttaa ttggctccct cttagctgaa 60 
gacaatggct ggtatatgtc tgtaggctat caaatcggtg gcacgcagca attcatcaat 120 
aacaaacaac ttttagaaaa tcaaaatatc atcaatagca tcactcaaag cgcgatcaac 180 
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attgcagggc ctactaccgg ccttatcact ttaagctctc aaaccgtcat tgacgcttta 240 

ggctatggcg tgagtaacac tgttggcaac caattagagg gcatttctaa catcttgaat 3 00 

caaattggca aaagaaaaga cttttattct agccgtcaaa tctctagcat ttcccagcaa 360 

atcatagggc ttaaaggaag ctctgatccc ttaaaagccc attcttcaca aatcacagcc 420 

aaactccttt ccaacaccca aagcgcgttt gatcagggca tcgctctaag ctctaatatc 480 

attagtgcag tcaatagcct aaaccctagc aacaactccc aagaagtcaa agcccagctc 540 

caaaacaccg cgcaatccat ggcggaatta ttgcaacaaa ttgaacacag catcactaaa 600 

accactagca ccacttacgc acaatcctta ctctccaatc fcgaccgatgc ggtgaatgca 660 

tctagcaata ataccactta tgtgagcgct cttgttaacg ctttaaacac tttaggggtg 72 0 

ggggttttcc ccaccacaac ctcaacgcat gtggtgctaa acccaccggg acaagtcgta 780 

ttctatccaa ctaattccct tttaggctct acttcttcaa acagcaataa ccaacaacaa 840 

tacaacaaca cccttttaat gaacacctta caaggggaat taagcactaa caatcaaaat 900 

aaccccaatg gttgcgccaa tcaaatccag tgtttagagc aattcatcca aaatttaacc 960 

cctttagccg caacccccac ttcaactaac caggccaacc agcaagtcca agccatcgct 1020 

caaaaacttc aaagcgttgc tatcaacgct ttagacaaca atgcgatcaa caacaccacc 10 80 

tataatttaa acaacttgca caacgctttg aatttccaag cctatcaaag cacgatagaa 1140 

caatacaata acgctttaaa gcaaatttcg tggattagtt ttagcgagcc taaaaacttg 120 0 

ctcaaaaaca cttccaataa ctaccaaatc ggcacggtta ccaacgatca agggcaaaat 1260 

atcagcgcct atgattgcac aagcgctacc ggaagccttt ctagcgatgc ttctagtggg 132 0 

atttcatgct cagccacaag ctccacaaat aacacaaata gttttgacaa ttctttagtc 1380 

gctacctcca aagtccaaac catcaacggc aaagagcaga tcggcgtgaa ttcttttaat 1440 

cttgtctctc aagtgtggag cgtttataac tctttaaaaa cttcagaaga aaatttgcaa 1500 

aaaaacgcca aaatattatg caacaatgga tcgcaatctg ggacaagccc atgcaatagc 1560 

tcttcagggg gtttgagcat cagcgggaac gcccaattgc aaaatatttt aagccctact 162 0 

aatgggacta ccactaatac tcaagctaaa agcaacgctt ccaaactaaa agcgatggta 1680 

atggtgaata atgaagaaga agccaaaacg accaatttca atcaaagcag tgggccaacc 1740 

acacaatctt ctaacagcac ggtgatggga gctttaaaca ccgtattgca aaatgtcagc 18 0 0 

aatttccaac aaagcattca aagcgctttt caaaaccaag aaaataatat ccaagcttgg 1860 

gcgaacgcac tttataacac tagtaaccct aatgggaatc aatcgcaaaa tttaaccact 192 0 

aacaataacc aagatttacg catccaatta agggcgaatt tttaccagct catcaatacc 198 0 

attaaccagc aagtgcctac agacatgaac gctttaatta atcaaagcca acaaacccag 2040 

caaacaagcg gafccagcaag caccacgaac aacgcatgcg cgagcggaat ggggagtagt 210 0 

ggcaactggt gcfcaccagca gtggtccgat tctaaggctt attacagcgg gttgcaaagc 2160 

gctttagggt atcaaacaca agcgacaact caaaatggga gcagtggtgg gagcaatatc 2220 

acctacaatg tccaacaaat cacgctcact agcggtggtt tgctcaatca aattatcaca 228 0 

aaccttaaga gcgttaatgg gggcagtaat gggggaagca gtgggaatgg cactagtcaa 234 0 

atcaacacag cctaccaaat gctcacagac gctagcgatg ggaaattagg gacttataat 240 0 

agtagcaata gtagcaatag tagcaatagt ggcaataata acggctatac gccatgcaat 2460 

agcaccaacg ggagcaatgg gacgagtggg agcaattgtt atgaacccaa caaacaacaa 2520 

aacgccacca ccgcaaccac cacgaccgac agcaatttac aaaaagtcta taatgacgcc 258 0 

caaaaaatag ccaatattat cgccagctct gggaacaata aaggcgttga aaacggctta 2640 

aaacaattct ttgaagcgtt aaaaagtaat agcagcagtc ttagtaattt atgtggtaat 270 0 

ggtagtagcg gtagtagctc tacttgctcc ggtgggctta tcaacctttt aggggcaatc 2760 

cccacaaacg gagtgagcga tacgaataat ttaattaatc tgctcactga attcattaaa 282 0 

accgccgggt ttatccaaaa taaggatagt aatgtatcta ctagtcttac aagcgctttt 288 0 

caagccatta cgagcgctat ttctcaaggg tttcaagcct tgcaaaacga tattagccct 2940 

aatgcgattt tgaccttgct ccaagaaatc acttctaaca ccaccaccat tcagtcattc 30 0 0 

tcgcaaacct tacggcagct tttaggggat aaaaccttct ttatggtgca acaaaagctc 3060 

attgatgcga tgattaacgc cagaaatcag gttcaaaacg cgcaaaatca agccaataac 312 0 

tacggctctc aacccgtttt aagccagtat gcggccgcta aaagcaccca acacggcatg 318 0 

agcaatggct taggggttgg cataggctat aaatacttct ttggtaaggc taggaaatta 324 0 

ggccttaggc attatttttt ctttgattac ggctttagtg aaataggcct agccaatcaa 33 0 0 

agcgtgaaag cgaatatctt tgcttatggg gtaggcacgg attttttatg gaatctattc 33 60 

aggaggactt acaacactaa agcgttgaat tttgggctat ttgccggggt ccaactgggc 342 0 

ggtgcaactt ggcttagttc cttaaggcaa caaatcattg acaactgggg gaacgctaat 348 0 

gacatccatt caacgaattt tcaagtggcg ctgaattttg gggtgcgcac caatttcgcg 3540 

gagtttaagc gttttgctaa gaaattccac aatcaagggg tcatcagcca aaagagcgtg 360 0 

gaatttggga tcaaggtgcc tctcatcaat caagcgtatt tgaatagtgc tggggctgat 3660 

gtgagctaca ggaggcttta tactttctat atcaattaca tcatggggtt ttaa 3714 

<212> Type : DNA 
<211> Length : 3714 

SequenceName : SEQ ID 479 

SequenceDescription : 



Sequence 
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<213> OrganismName : Helicobacter pylori, si 
<40 0> PreSequenceString : 

atgaaacaaa atttaaagcc attcaaaatg attaaggaaa 
aaagtaagat tcttagcccc tttgagccta gcgttaagct 
5 gctgaagaag atgggggctt tatgaccttt gggtatgaat 
gtgaaaaacc cgggtaaaat caaagccgaa gaattagcgg 
acaaacaaca ccaatatcaa tattgcaggc acaggaggga 
aaccttttta tgaaccaatt gggcaatttg attgatttgt 
aatcttcacc aatgcggtag cactaatagc ggtaatggcg 

10 aacaatagcc cttgtttcca aggtaacctg gctctttata 
aaaactttga gtcaaaacat cagcaagaac atctttcaag 
gctaatctct ccaaccagct cagtgagttg aacaccgcta 
aactcgttct taaacgccaa caaccaagcg ggtgggattt 

, . gcttacgaga atggtgttac cgctcaacaa atcgcttatg 

15 actatggggc caagcggtga tagtggggct gcgggagcgt 
caacatgttt tcaactcggc taacgctggg aacgatttga 
ttggtgcaaa acatcgtcaa taattctcaa aacgctttaa 
atcagcaatt caacaggcta tcaagtgagc tatggtggga 
acccaactgt taaacaacac cacaaacact ttggctaaag 

20 cttaaagcta acccatggct tgggaatttc gctgctggta 
tttaacgggt ttatcactaa aafccggttat aagcaattct 
ggcttacgct actacgggtt cttcagctat aacggcgcgg 
tacaatcaag tcaatctgct cacttatggg gtggggactg 
agccgctctt ttggcagtag gagtcttaat gcgggcttct 

25 ggggacactt acatcagcac gctaagaaac agccctcagc 
acaaaattcc aattcttgtt tgatgtgggc ttacgcatga 
gacctaaaaa gccataacca gcattctata gaaatcggtg 
aacacttact ataaagctgg tggcgctgaa gtgaaatact 
tgggtctatg gctacgcctt ctaa 

30 <212> Type : DNA 

<211> Length : 1584 

SequenceName : SSQ ID 480 
, SequenceDescription r 

35 Sequence 



<213> OrganismName : Helicobacter pylori, strain 0*99 
<4 0 0 > PreSequenceString : 

atgaaaaaaa cccttttact ctctctctct ctctcgtttg ggctccacgc tgaagacgac 60 

40 ggcttttacg caagcgcggg aattcggatc ggtgaagccg ctcaaatggt gaaaaacacc 12 0 

aaaggcattc aacagctttc agagaattat gaaaagttga acaatctttt aaataattac 18 0 

aacaccctaa acactcttgt aaagctgtcc tccgatccga gtgctgtcaa cgacgcaagg 240 

gataatctag gctcaagcac taggaatttg ctagatgtca aagccaattc ccccgcctat 3 00 

caagcggtgc ttttagcatt gaacgctgca gtgggcttgt ggcaagttac aagctatgcc 360 

45 tttaccgctt gtggtcctgg tagcaatgag aacgcgaatg gaggtatcca aacctttaat 420 

aatgtgccag gacaaaacac gacgaccatc acttgtaatt cgtattatga gccaggacat 480 

ggcgggccaa tatccactaa aaattatgcg atcatcaaca aggcttatca aatcattcaa 540 

aaggctttga cagccaatgg agaagggatc ccagttttaa gcaacaccac tacaaaactt 600 

gatttcacta tcaatggaga caaaagaacg ggtggcgaac caaataaaaa attagtatac 660 

50 ccatggagtc atgggaaagc tatttcaacc tcgtggaatg caaccataac agcaccaaca 720 

acagaaaata tcaatacaac caatagcgct caagagcttt taaaacaagc gagcatcatt 780 

atcactaccc tgaatagtgc atgcccaaac ttccaaaatg gtggtagcgg ttattgggca 840 

gggataagtg gcaatgggac aatgtgtggg atgtttaaga atgaaatcag cgctatccaa 900 

ggcatgatcg ctaacgcgca agaagctgtc gcgcaagcca aaatcgttag tgaaaacacg 960 

55 caaaatcaaa acagcctaga cgctggaaaa ccattcaacc cctacacaga cgctagtttt 102 0 

gctgaaagca tgctcaaaaa cgcgcaagcc caagcggaga ttttaaacca agccgaacaa 1080 

gtggtgaaaa actttgaaaa aatccctaca gcctttgtaa atgactcttt aggggtgtgt 114 0 

tatgaagtgc aaggaggtga gcgtcgtggc accaatccgg gtcagacgac ttctaacact 1200 

tggggggcag gctgtgcgta tgtaggacaa acgataacaa atcttaaaaa cagcatcgcc 12 60 

60 cattttggca ctcaagagca gcagatacag caagccgaaa acatcgctga cactctggtg 1320 

aatttcaaat ctagatacag cgaattgggc aacacttata acagcatcac cactgcgctc 1380 

tctaatatcc ctaacgcgca aagcttgcaa aatgcggtga gtaaaaagaa taacccctat 1440 

agcccgcaag gcatagacac caattactac ctcaatcaaa actcttacaa ccaaatccaa 1500 

accatcaacc aagaactcgg gcgtaacccc tttaggaaag tggggattgt tagttctcaa 1560 

65 accaataatg gcgcgatgaa tgggatcggt attcaggtgg gttacaaaca attctttggc 162 0 

caaaaaagaa aatggggcgc taggtattac ggcttttttg attacaacca tgcgttcatt 168 0 

aaatccagct tcttcaactc ggcttctgac gtgtggactt atggctttgg agcggacgct 1740 



:rain J99 

atttaatgac acaatctcaa 60 

tgagcttcaa tccagtgggc 12 0 

taggtcaggt ggtccagcaa 180 

gcctgttaaa ctctaccacg 240 

atgtcgccgg gactttgggc 3 00 

atcctacttt gaaaactaat 360 

ctactgctgc cgctgctact 420 

acgaaatggt tgactctatc 480 

gcgacaacaa caccacgagc 540 

gcgtttattt . gacttacatg 600 

ttcaaaacaa caccaatcaa 660 

tcctaaagca agcttcaatc 720 

ttttagacgc cgctttagcc 780 

gcgctaagga attcactagc 84 0 

cgctagccaa caacgctaac 900 

atattgatca agcgcgctct 960 

ttaccgctct aaacaacgag 102 0 

acagctctca agtgaatgcg 10 8 0 

tcggggaaaa caagaatgtg 1140 

gcgtgggtaa tggccccact 12 00 

atgtgcttta caatgtgttt 1260 

ttggggggat ccaactcgca 1320 

ttgcgagcag acctacagcg 1380 

actttggtat cttgaaaaaa 1440 

tgcaaatccc tacgatttac 1500 

tccgccctta tagcgtgtat 1560 

1584 
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ctttataatt tcatcaacga taaagccacc aatttcttag gcaaaaacaa caagctttct 18 00 

9tggggcttt ttgggggtat tgcattagcc gggacttcat ggcttaattc tgagtatgtg 1860 

aatttagcca ccatgaataa cgtctataac gctaaaatga acgtggcgaa tttccaattc 1920 

ttattcaaca tgggagtgag gatgaattta gccaggccta agaaaaaaga cagcgatcat 1980 

5 gcggctcagc atgggattga gttagggctt aaaatcccca ccatcaacac gaactactac 2040 

tcctttatgg gggctgaact caaataccga aggctctata gcgtgtattt gaattacgtg 2100 

ttcgcttact aa 2112 
<212> Type : DNA 
<211> Length : 2112 
10 SequenceName : SEQ ID 481 
SequenceDe script ion : 

Sequence 

15 <213> OrganismName : Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgataaaga agaatagaac gctgtttctt agtctagccc tttgcgctag cataagttat 60 

gccgaagatg atggagggtt tttcaccgtc ggttatcagc ttgggcaggt catgcaagat 120 

gtccaaaacc caggcggcgc taaaagcgac gaactcgcca gagagcttaa cgctgatgta 18 0 

20 acgaacaaca ttttaaacaa caacaccgga ggcaatgtcg caggggcgtt gagtaacgct 240 

ttctcccaat acctttattc gcttttaggg gcgtatccca cgaaactcaa tggtaacgac 3 00 

gtgtctgcga acgctctttt aagtggtgcg gtaggcagtg ggacttgcgc ggctgcaggg 360 

acggctggtg gcacaactct taacactcaa agcgcttgca ccgctgcggg ctattactgg 42 0 

ctccctagct tgactgatag gattttaagc acgatcggca gccagactaa ctacggcacg 480 

25 aacaccaatt tccccaacat gcaacaacag ctcacctact tgaatgcggg gaatgtgttt 540 

tttaatgcga tgaataaggc tttagagaag aatgggactg ctactgctaa tagcactagt 60 0 

agcactagcg gtgcgactgg ttcagatggt caaacttact ctcaacaagc tattcaatac 660 

cttcaaggcc aacaaaatat cttaaataac gcagcgaact tgctcaagca agatgaattg 720 

ctcctagaag ctttcaactc tgccgtagct gctaacattg ggaataagga attcaattca 780 

30 gccgctttta caggtttggt gcaaggcatt attgatcaat ctcaattggt ttataacgag 840 

ctcactaaaa acaccattag cgggagcgcg gttaataacg ctgggataaa ctccaaccaa 900 

gctaacgctg tgcaagggcg tgctagtcag ctccctaacg ctctttataa cgtgcaagta 960 

actttggata aaatcaacgc gctcaacaat caggtgagaa gcatgcctta cttgccccaa 1020 

ttcagagccg ggaacagccg tgcaacgaat attttaaacg ggttttacac taaagtgggc 108 0 

35 tataagcaat tcttcgggaa gaaaaggaat atcggtttgc gctattatgg tttcttttct 1140 

tataacggag cgagcgtggg ctttagatcc actcaaaata atgtagggtt atacacttat 1200 

ggggtgggga ctgatgtgtt gtataacatc tttagccgct cctatcaaaa ccgctctgtg 1260 

gatatgggct tttttagcgg tatccaatta gccggtgaga ccttccaatc cacgctcaga 1320 

gatgacccca atgtgaaatt gcatgggaaa atcaataaca cgcacttcca gttcctcttt 13 8 0 

.40 gacttcggta tgagaatgaa cttcggtaag ttggacggga aatccaaccg ccacaaccag 1440 

cacacggtgg aatttggcgt agtagtgcct acgatttata acacttatta caaatcagca 150 0 

gggactaccg tgaagtattt ccgtccttat agcgtttatt ggtcttatgg gtattcattc 1560 

taa 1563 
<212> Type : DNA 

45 <211> Length : 1563 

SequenceName : SEQ ID 482 
SequenceDescription : 

Sequence 
50 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaagaaaa aatttctgtc attaacctta ggttcgcttt tagtttccgc tttaagcgct 60 

gaagacaacg gcttttttgt gagcgccggc tatcaaatcg gtgaatccgc tcaaatggtg 120 

55 aaaaacacca aaggcattca agatctttca gacagctatg aaagattgaa caacctttta 180 

acgaattata gcgtcctaaa cgctctcatc aggcagtccg ccgaccccaa cgccatcaat 240 

aacgcaaggg gcaatttgaa cgcgagcgcg aagaatttga tcaatgataa aaagaatt cc 300 

ccggcgtatc aagccgtgct tttagccttg aatgcggcag cggggttgtg gcaagtcatg 3 60 

agctatgcga tcagcccttg tggtcccggt aaagacacaa gcaaaaatgg gggcgttcaa 420 

60 actttccaca acacgccttc aaatcaatgg ggaggcacta ccattacttg tggcactact 480 

ggttatgaac caggaccata cagcatttta tccactgaaa attacgcgaa aatcaataaa 540 

gcttatcaaa tcatccaaaa ggcttttggg agcagcggaa aagatattcc tgccttaagc 600 

gacaccaaca cagaactcaa attcacaatc aataaaaata atggaaacac gaatacgaat 660 

aataatggag aagaaattgt tacaaaaaat aacgctcaag ttcttttaga acaggctagc 720 

65 accattataa ctacccttaa tagcgcatgc ccatggatca acaatggtgg tgcaggtggt 780 

gcgagtagtg gtagtttatg ggaaggaata tatttgaaag gcgatgggag cgcttgcggg 840 

atttttaaaa atgaaatcag cgcgattcaa gacatgatca aaaacgctgc aatagccgta 900 
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gagcaatcca agatcgttgc tgcaaacgcg caaaaccagc gcaacctaga caccgggaag 960 

acattcaacc cctataaaga cgccaacttc gcccaaagca tgttcgctaa cgccaaagcg 1020 

caagcggaga ttttaaaccg cgcccaagca gtggtgaaag actttgaaag aatccctgca 108 0 

gagttcgtaa aagactcttt aggggtgtgc catgaagtgc aaaacggcca tctccgtggc 1140 

5 acgccatccg gcacggtaac tgataacact tggggagccg gttgcgcgta tgtgggagag 12 0 0 

accgtaacga atctaaaaga cagcatcgct cattttggcg accaagccga gcgaatccat 1260 

aacgcgcgca acctcgccta cactttagcg aacttcagca gtcagtatca aaaactaggc 13 2 0 

gaacactatg acagcatcac agcggccatt tcaagcttgc ctgatgcgca atctttacaa 13 80 

aatgtggtga gcaaaaagac taaccctaat agcccacaag gcatacagga taactactat 1440 

10 attgactcca atatccattc tcaagtgcaa tctaggagtc aagaactcgg cagtaaccct 1500 

ttcaggcgtg ctggcttaat cgccgcttct accaccaata acggcgcgat gaacgggata 1560 

ggctttcaag tgggctataa gcaattcttt gggaaaaaca aacgatgggg cgcaaggtat 1620 

tacggctttg tggattacaa ccacacctat aacaaatccc aatttttcaa cgcctcttct 1680 

gatgtctgga cctatggcgt ggggagcgat ttgttagtga atttcatcaa cgataaagcc 1740 

15 actaaacaca ataagatttc ttttggcgcg tttggcggta tcgccttagc cgggacttca u 1800 

tggcttaatt ctcagtatgt gaatttagcg aacgtgaata attattataa ggccaaaatc 1860 

aacacggcga atttccaatt cttattcaat ctgggtttga gaatgaacct cgctaggaaa 192 0 

aagcatagag cgaccgataa cgcggcccaa catggcattg aactaggcac aaagatcccc 1980 

acgatcaaca cgaattacta ttctttgcta ggcactacct tgcaatacag aaggctttat 2040 

20 agcgtgtatc tcaactatgt gttcgcttac taa 2073 
<212> Type : DNA 
<211> Length : 2073 

SequenceName : SEQ ID 483 
SequenceDe script ion : 



25 



Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<4 00> PreSequenceString : 
30 atgaaaatca aaaaatccct ctttgctctc tctttctctc tcatggcttc attatcaagg 60 
gctgaagatg acggatttta catgagtgtg ggctatcaaa tcggtgaagc ggtccaaaaa 120 
gtgaaaaaca ctggagcatt acaaaatctt gcagacagat acgataactt gagcaacctt 180 
ttaaaccaat acaattactt aaattcctta gtcaatctag ccagcacgcc tagcgcgatt 240 
accggtgcga ttgacaatct aagctcaagc gcgatcaatc tcactagcgc taccaccact 300 
35 tctccggcct atcaagctgt ggctttagcg ctcaatgcgg ctgtgggcat gtggcaagtc 360 
atagcctttg gcatcagctg tggccctggc cccaatcttg gcccagaaca tttagaaaat - 420' 
gggggcgttc gatcgtttga caacacgcca aactacagct acaacaccgg tagcggaacg 480 
accaccacca cttgtaatgg agccagtaat gtagggccca atggtatcct atctagcagc 540 
gaataccagg ttctcaatac cgcttatcaa actatccaaa ccgctttaaa ccaaaaccaa 600 
40 ggaggcggga tgcctgcctt gaatagctcc aaaaatatgg tagtcaatat caatcaaact 660 
ttcacaaaaa accctaccac agaatacact taccccgatg ggaatggcaa ttattattca 720 
ggcggttcat caatcccaat ccagctaaag attagtagcg tcaatgacgc tgaaaacctt 780 
ttgcaacaag ccgctactat catcaatgtc cttaccaccc aaaacccgca tgtgaatggt 840 
ggcggtgggg catgggggtt tggcggtaag accgggaatg tgatggatat ttttggcgat 900 
45 agttttaacg ctattaacga aatgatcaaa aacgctcaag ccgttttaga aaaaacccaa 960 

cagcttaacg ctaatgaaaa cacccaaatc acgcaaccag acaatttcaa cccctacact 1020 

tctaaagaca cgcagttcgc tcaagaaatg ctcaatagag ctaacgctca agcagagatt 10 80 

ttgagcttag cccaacaagt agcggacaat ttccacagca ttcaagggcc tatccaacaa 1140 

gatctagaag aatgcaccgc aggatcagct ggtgtgatta acgacaacac ttatggttca 12 00 

50 ggttgcgcgt ttgtgaaaga gactctcaat tccttagagc aacacaccgc ttattatggc 1260 

aaccaggtca atcaggatag ggctttgtct caaaccattt tgaattttaa agaagccctt 13 20 

agcactttag ggaacgactc aaaagcgatc aatagcggta tctctaactt gcctaacgct 13 80 

aagtcccttc aaaacatgac gcatgccact caaaacccta attccccaga aggtttgctc 1440 

acttattctt tggataccag caaatacaac cagctccaaa ctgttgcgca agaattaggc 1500 

55 aaaaacccct ttaggcgcat cggcgtgatt aactatcaaa acaataacgg ggcgatgaac 1560 

ggcatcggcg tgcaagcggg ctataagcaa ttctttggca aaaaaaggaa ttgggggtta 1620 

aggtattatg gtttctttga ttataaccat gcttatatca aatctaattt ttttaactcg 1680 

gcttctgatg tgtggactta tggggtgggt atggacgcgc tttataactt catcaacgat 1740 

aaaaacacca actttttagg caaaaataac aagctttctg tggggctttt tggtggcttt 1800 

60 gcgttagccg ggacttcgtg gcttaattcc caacaagtga atttgaccat gatgaatggc 1860 

atttataacg ctaatgtcag cgcttctaac ttccaatttt tgtttgattt aggcttgaga 1920 

atgaacctcg ctaggcccaa gaaaaaagac agcgatcatg ccgctcagca tggcatggaa 1980 

ttgggcgtga aaatccccac cattaacacg gattattatt ctttcatggg ggctgaactc 2040 

aaatacagaa ggctctatag cgtgtatctc aattatgtgt ttgcttacta a 2091 



65 



<212> Type : DNA 
<211> Length : 2091 
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SequenceName : SEQ ID 484 
SequenceDescription : 



Sequence 
5 

<213> Organi smName : Helicobacter pylori, strain J99 
<400 > PreSequenceString : 

atgaaaatca aaaaatccct ctttgctctc tctttctctc tcatggcttc attatcaagg 60 

gctgaagatg acggatttta catgagtgtg ggctatcaaa tcggtgaagc ggtccaaaaa 12 0 

10 gtgaaaaaca ctggagcatt acaaaatctt gcagacagat acgataactt gagcaacctt 180 

ttaaaccaat acaattactt aaattcctta gtcaatctag ccagcacgcc tagcgcgatt 240 

accggtgcga ttgacaatct aagctcaagc gcgatcaatc tcactagcgc taccaccact 3 00 

tctccggcct atcaagctgt ggctttagcg ctcaatgcgg ctgtgggcat gtggcaagtc 3 60 

- atagcctttg gcatcagctg tggccctggc cccaatcttg gcccagaaca* tttagaaaat 420 

15 gggggcgttc gatcgtttga caacacgcca aactacagct acaacaccgg tagcggaacg 480 

accaccacca cttgtaatgg agccagtaat gtagggccca atggtatcct atctagcagc 540 

gaataccagg ttctcaatac cgcttatcaa actatccaaa ccgctttaaa ccaaaaccaa 600 

99 a gg c 9gga- tgcctgcctt gaatagctcc aaaaatatgg tagtcaatat caatcaaact 660 

ttcacaaaaa accctaccac agaatacact taccccgatg ggaatggcaa ttattattca 720 

20 ggcggttcat caatcccaat ccagctaaag attagtagcg tcaatgacgc tgaaaacctt 780 

ttgcaacaag ccgctactat catcaatgtc cttaccaccc aaaacccgca tgtgaatggt 840 

ggcggtgggg catgggggtt tggcggtaag accgggaatg tgatggatat ttttggcgat 90 0 

agtttfcaacg ctattaacga aatgatcaaa aacgctcaag ccgttttaga aaaaacccaa 960 

cagcttaacg ctaatgaaaa cacccaaatc acgcaaccag acaatttcaa cccctacact 1020 

25 tctaaagaca cgcagttcgc tcaagaaatg ctcaatagag ctaacgctca agcagagatt 1080 

ttgagcttag cccaacaagt agcggacaat ttccacagca ttcaagggcc tatccaacaa 1140 

gatctagaag aatgcaccgc aggatcagct ggtgtgatta acgacaacac ttatggttca 12 00 

ggttgcgcgt ttgtgaaaga gactctcaat tccttagagc aacacaccgc ttattatggc 1260 

aaccaggtca atcaggatag ggctttgtct caaaccattt tgaattttaa agaagccctt 1320 

30 agcactttag ggaacgactc aaaagcgatc aatagcggta tctctaactt gcctaacgct 13 80 

aagtcccttc aaaacatgac gcatgccact caaaacccta attccccaga aggtttgctc 1440 

acttattctt tggataccag caaatacaac cagctccaaa ctgttgcgca agaattaggc 1500 

aaaaacccct ttaggcgcat cggcgtgatt aactatcaaa acaataacgg ggcgatgaac 1560 

ggcatcggcg tgcaagcggg ctataagcaa ttctttggca aaaaaaggaa ttgggggtta 1620 

35 aggtattatg gtttctttga ttataaccat gcttatatca aatctaattt ttttaactcg 1680 

gcttctgatg tgtggactta 1 tggggtgggt atggacgcgc tttataactt catcaacgat 1740 

aaaaacacca actttttagg caaaaataac aagctttctg tggggctttt tggtggcttt 1800 

gcgttagccg ggacttcgtg gcttaattcc caacaagtga atttgaccat gatgaatggc 1860 

atttataacg ctaatgtcag cgcttctaac ttccaatttt tgtttgattt aggcttgaga 1920 

40 atgaacctcg ctaggcccaa gaaaaaagac agcgatcatg ccgctcagca tggcatggaa 1980 

ttgggcgtga aaatccccac cattaacacg gattattatt ctttcatggg ggctgaactc 2040 

aaatacagaa ggctctatag cgtgtatctc aattatgtgt ttgcttacta g 2091 

<212> Type : DNA 
45 <211> Length : 2091 

SequenceName : SEQ ID 485 
SequenceDescription : 

Sequence 
50 

<213> Organi smName : Helicobacter pylori , strain J99 
<4 00> PreSequenceString : 

ttgcataaaa aagttctgtt ggctttaact gccagcttga tttgccaaga gtctttgttc 60 

gctaaggata aagactacac tttgggcaag gtttctactg ccggtaaaaa ggatagatct 120 

55 gactattctg ggcaggtcaa tttgggttat agcgggatta ccgcgcctaa gagttggcaa 180 

gatgaagaag tgaaaaaata cacaggaagc cgcacggtga tctctaacaa agcgctcacc 240 

caacaagcta accaaagcat tgaagaagct ttacagaatg tccccggtct gcaaattagg 3 00 

aatgccacag gtgtgggggc tatgcctact atccaaatcc gtggctttgg agcggggggt 360 

tcagggcata gcgatgcgac gctcatgtta gttaatggta ttcctgttta tatggcccct 420 

60 tacgctcaca ttgagctaga cattttccct gttacctttc aagccattga tcgcattgat 480 

gtgatcaaag gtggaggcag cgtgcaatat gggcctaaca cttatggggg tattgtcaat 540 

atcatcacta aacctatccc taatcaatgg gaaaaccaag cggctgaaag gatcacttat 600 

tgggctaagg ctagaaacgc tgggtttgcc gctccccctg ataaaaccgg cgatccttct 660 

ttcatcaagt ctttaggcaa caacctcctc tataacactt atgtgaggag cggagggatg 720 

65 atcaataagc atgtgggtat ccaagcgcaa gctaactggg ttagaggcca aggctttagg 780 

gacaatagcc cctctagtat ttcaaactat tggctggatg gggtctatga catcaatgaa 840 

agcaatggga ttaaagccta ttaccaatac tacgattttg ctatcgccca accgggatca 900 
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ctcagcgagc 
ggcgggcgct 
gtgggcggga 
agctctagct 
5 gcagggcttc 
tgggcggagg 
ctgatcgtca 
accacttttt 
gagggggcag 

10 aagcatggcg 
agcgatcgca 
tttgtgcaat 
agaaaaatca 
..caaggpgatc 

15 ttagacgttt 
gcaggagcgc 
tgggcgcgcg 
aatgtgcgcc 
attagagggt 

20 ggccctttaa 
ccttttgtaa 
attggtattt 
ggaggctatt 
agcggttatc 

25 tgggtgtgga 
ggaagcttac 
agccctgcag 
ttctaa 



aagattacaa 
cacaacgctt 
ctttcagctt 
acaatagcgc 
cggcagggta 
tagaaaaccc 
ataccgggaa 
tacaacgcca 
gcttcttgtg 
tttatagaaa 
ttgaagcttg 
acaacaacga 
agcacatgaa 
acaatgtgct 
tgagctatgg 
gctacaccta 
attttgccac 
ccattaatgg 
tgcaattcca 
ccgacttgaa 
gccctttcca 
ctagctattt 
atgggatgca 
aatgcgaagc 
atatccaagt 
aaatcaataa 
gcttgcaacc 



aataaaccgc 
tggggctgtg 
cacttactac 
taacatggtt 
taacttggct 
tgtgcgttcc 
agtcaggcaa 
atacttaaac 

tgagggccct 

ctggaataac 
ggacgggcgc 
aaatgcgtct 
caattggatg 
tacctacttc 
aggagcggag 
taaagataaa 
cgggcagtat 
ctattctcaa 
tgccgctttc 
cggggatgtg 
attcattttt 
ttatagccgt 
atactatagt 
ttggtgtatg 
gagccaaatt 
catcttcaac 
tgcgcctgga 



ttcgctaatt 
tatgaaaacc 
gggcagttga 
acttgtttta 
gtgccttatt 
attaacaacg 
acctttatca 
accaatgaat 
aacgtgatga 
tggcgcaaca 
tttttcatcg 
aactggatgc 
ccctcaacca 
aactaccaac 
tattttaccc 
ttcagcttca 
tcagtctata 
ggcgtggagc 
aactacattg 
ctaaaaggga 
gacgctcgct 
gcttatagcg 
ggggggaaca 
acccaacatg 
ttctgggaaa 
atgaagtatt 
agatcggtta 



tgcgcccctt 
gcttcgggga 
tgactaggga 
gcgaagcggc 
atgccactaa 
cttttgagcc 
tgggtttgcg 
gcgccactaa 
gcggttggaa 
attacacagc 
tgcctggttt 
aaatccctga 
acattggctt 
gctctttcgt 
aacactttga 
atgcggacta 
cgagcgggcc 
tggaattgta 
acactcgtgt 
ctagctataa 
acaattggcg 
ggattagcaa 
actatgaaag 
aagggctctt 
acggaagaca 
attttacagg 
cagcgtattt 



aaaccaaaaa 
tttagacaga 
ctttcaggtg 
atgcagggcg 
ctacaatggt 
taaagtgaat 
tttcatgacc, 
aacgagcggt 
accccacatc 
ggtctatttg 
gcgctacgct 
gaaggattta 
tatccctgtg 
cccgcctcaa 
cacggtggaa 
ctttaggatt 
catgaaaggt 
ttacaggccc 
aactagccat 
caagcatttc 
taaaaccacc 
cagcgcagca 
cgttcttaat 
gccttggtat 
cagagttaca 
gattggctct 
gaactacact 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
162 0 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2526 



<212> Type : DNA 
30 <211> Length : 2526 

SequenceNatne : SEQ ID 486 
SequenceDescription z 



35 



40 



45 



50 



55 



60 



65 



Sequence . 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaaaa cccttttact ctctctctcc gcttcatcgc ttttaaacgc tgaagacaac 60 

ggctttttta tcagcgcggg ctatcaaatc ggtgaagccg ctcaaatggt gaaaaacacc 120 

ggcgaattga aaaaactttc agacacttat gagaatttga gcaacctttt aaccaatttt 180 

aacaacctca atcaggcggt aacgaacgcg agcagccctt cagaaatcaa tgctgcgatc 240 

gataatttaa aagcaaacac gcaagggcta attggcgaaa aaaccaattc cccggcgtat 300 

caagcggtgt atttggcgct caatgcggcg gtagggctgt ggaatgtcat cgcctataat 3 60 

gtccaatgcg gtcctggtaa cagtggacaa caaagcgtaa cctttgaggg ccaaccagga 420 

cataattcaa gttccattaa ttgcaattta accggttata acaacggggt tagcggccct 4 80 

ttatccattg agaattttaa aaagcttaat caggcttatc aaactatcca acaagcttta 540 

aaacaagata gcggatttcc tgttttggat agtgcaggaa aacaagtaac tataacaata 600 

acaacgcaaa ctaatggagc taataaaagt gaaactacta ctactactac tactactaat 660 

gacgctcaaa cccttttgca agaagccagt aaaatgataa gcgtcctcac tacaaactgc 720 

ccatgggtca atcacaatca aggacaaaac gggggcgcgc cgtggggttt agatacggca 780 

gggaatgtgt gtcaggtttt tgccacggaa tttagcgccg ttactagcat gatcaaaaac 840 

gcccaagaaa tcgtaacgca agctcaaagc cttaaccagc aaaacaatca aaacgcgccg 9 00 

caagatttca atccttacac ctctgctgat agggctttcg ctcaaaacat gctcaatcac 960 

gcgcaagcgc aagccaagat acttgagcta gccgatcaaa tgaaaaaaga ccttaacact 1020 

atcccaagcc aatttatcac aaattacttg gcagcttgcc acaatggggg tgggacatta 10 80 

cctgatgcgg gggttactaa caacacttgg ggggccggtt gcgcgtatgt ggaagagacg 1140 

ataacggctt taaacaacag ccttgcgcat tttggcactc aagctgagca aatcaagcaa 12 00 

tctgagttgt tggcgcgcac catacttgat tttagaggca gccttagtaa tttaaacaac 12 60 

acttataaca gcatcaccac gaccgcttca aacacgccta attccccatt ccttaaaaat 1320 

ttgataagcc aatccactaa ccctaataac cccgggggct tacaggccgt ttatcaagtc 13 80 

aaccaaagcg cttattcgca attattaagc gccacgcaag aattagggca taaccctttc 144 0 

agacgcgttg gattaatcag ctctcaaacc aacaatggtg ccatgaatgg gatcggtgtg 15 00 

caagtgggct acaaacaatt ttttggtgaa aagagaaggt gggggttaag gtattacggc 1560 

ttttttgact ataaccatgc ttatatcaaa tctagctttt tcaattcggc ttctgatgtg 1620 

ttcacttatg gggtagggac agatgtcctc tataacttta tcaatgataa aaccaccaaa 1680 

aacagcaaga tttcttttgg ggtgtttggg gggattgcgt tagctggcac ttcatggctg 1740 

aattcccagt atgtgaattt agcgaccttc aataatttct atagcgctaa aatgaatgtg 18 0 0 
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gcgaatttcc aattcttgtt caatttaggc ttgagaatga acctcgctaa gaataagaaa 1860 
aaagcgagcg atcatgcggc tcagcatggc gtggaattag gcgtgaagat ccccacgatc 192 0 
aacacgaatt actattcttt gctaggcact caactccaat accgaagatt gtatagcgtg 198 0 
tatttgaatt atgtgttcgc ttactaa 2007 
<212> Type : DNA 
<211> Length : 2007 

SequenceName : SEQ ID 487 

SequenceDescription : 



10 Sequence 



<213> OrganisrnName : Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgagaaaac tattcatccc acttttatta ttcagcgctt tagaagcgaa. cgagaaaaac 60 

15 ggctttttca tagaagccgg ctttgaaact gggctattag aaggcacaca aacgcaagaa 120 

aaaagacaca ccaccacaaa aaacacttac gcaacttaca attatttacc cacagacacg 180 

attttaaaaa gagcggctaa tttattcacc aatgccgaag cgatttcaaa attaaaattc 240 

tcatctttat cccctgttag agtgttgtat atgtataatg gtcaattaac tatagaaaac 3 00 

ttcttgcctt ataatttaaa taatgttaag cttagtttta cagacgctca aggcaacacg 3 60 

20 attgatctag gcgtgataga gaccatcccc aaacactcta agattgtttt acccggggag 420 

gcgtttgata gtttaaaaga ggcgtttgat aaaattgacc cctatacttt atttcttcca 480 

aaatttgaag ccactagcac ttctatttct gatactaaca cgcagagggt gtttgaaacg 540 

ctcaataaca ttaaaacaaa tcttataatg aaatatagta atgaaaatcc aaacaatttc 600 

aacacttgtc cttacaataa taatggtaat acaaaaaatg attgttggca aaatttcacc 660 

25 ccacaaaccg cagaagaatt caccaattta atgttgaaca tgatcgctgt cttagactcc 720 

caatcttggg gcgatgcgat cttaaacgct ccttttgaat tcactaacag ctcaacagat 780 

tgcgatagcg atccttcaaa atgcgtaaat cccggagtaa atgggcgtgt tgatactaaa 840 

gtcgatcaac aatatatact caacaaacaa ggtattatta ataattttag aaaaaaaata 900 

gaaattgatg cggttgtttt aaaaaattca ggggttgtag ggttagccaa tggatatggc 960 

30 aatgatggtg aatatggcac attaggggta gaagcctatg ctttagatcc taaaaaactc 1020 

tttggcaacg accttaagac tatcaattta gaagatttaa gaaccatctt gcatgaattc 1080 

agccacacta aaggctatgg gcataacggg aatatgacct atcaaagagt gccggtaacg 1140 

aaagatggtc aagtggaaaa ggatagtaat ggcaagccaa aagattctga tggcctcccc 12 00 

tataatgtgt gttcgcttta tgggggatcc aatcagcccg ctttccctag caactaccct 1260 

35 aattccatct atcacaattg tgcggatgtc ccggctggct ttttaggggt aacagcagcg 1320 

gtttggcagc agctcatcaa tcaaaacgcc ttgccgatca actacgctaa cttggggagt 13 80 

caaacaaact acaacctaaa cgctagttta aacacgcaag atttagccaa ttccatgctc 1440 

agcaccatcc aaaaaacctt tgtaacttct agcgttacca accaccattt ttcaaacgca 15 00 

tcgcaaagtt ttagaagccc tattttaggg gttaacgcta aaataggcta tcaaaactac 1560 

40 tttaatgatt tcatagggtt ggcttattat ggcatcatca aatacaatta cgctaaagct 1620 

gttaatcaaa aagtccagca attgagctat ggtgggggga tagatttgtt attggatttc 1680 

atcaccactt actccaataa aaatagccct acaggcattc aaaccaaaag gaatttttct 1740 

tcatcttttg gtatctttgg ggggttaagg ggcttgtata acagctatta tgtgttgaac 18 0 0 

aaagtcaaag gaagcggcaa tttagatgtg gctaccgggt tgaactaccg ctataagcat 1860 

45 tctaaatatt ctgtagggat tagcatccct ttaatccaaa gaaaagctag cgtcgtttct 1920 

agcggtggcg attatacgaa ctcttttgtt ttcaatgaag gggctagcca ctttaaggtg 19 80 

tttttcaatt acgggtgggt gttttag 2 0 07 
<212> Type : DNA 
<211> Length. : 2007 

50 SequenceName : SEQ ID 48 8 

SequenceDescription : 

Sequence 



55 <213> OrganisrnName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaataaaa caacaattaa aatattaatg ggcatggcgt tattatcatc gcttcaagcc 60 

gcagaggcag agcttgatga aaaatcaaaa aaacctaaat ttgcggatag gaatacgttt 12 0 

tatttagggg ttgggtatca gcttagcgcg atcaacacgt cttttagcac cagttctata 180 

60 gataaatcgt atttcatgac cggcaatggt tttggcgtgg tgttgggggg gaaatttgtg 240 

gctaaaacgc aagctgtaga gcatgtgggt tttcgttacg ggttgtttta tgatcagacc 3 00 

ttttcttctc acaaatccta tatttctacc tatggtttag aatttagcgg tttgtgggac 360 

gctttcaatt cgccaaagat gtttttgggg ttggagtttg gcttaggcat cgctggggcg 420 

acttacatgc caggaggggc catgcatggg attatcgctc aatatttagg caaagaaaat 480 

65 tcgcttttcc aattgcttgt gaaagtgggt tttcgttttg gctttttcca caatgaaatc 540 

acctttgggt tgaaattccc tgtcattcct aacaaaaaaa cggaaatcgt tgatggcttg 600 

agcgcgacca ctttatggca acgcttgccg gtagcctatt tcaattatat ctataatttt 660 



WO 2005/076010 



174/341 



PCT/IN2005/000037 



tag 

<212> Type : DNA 

<211> Length : 663 

SequenceName : SEQ ID 489 
SequenceDescription : 



663 



Sequence 

<213> OrganismName : Helicobacter pylori , strain J99 

10 <400> PreSequenceString : 

atgaaaaaaa cgaaaaaaac gattctgctt tctctaactc ttgcggcgtc 
gctgaagaca acggcgtttt tttaagcgtg ggctatcaaa tcggtgaagc 
gtgaaaaacg ccgacaaggt acaaaagctt tcagacgttt atgaacaatt 
t.tagccaacg ataatggcac tagctcaaaa acaagcgcgc aagcgatcaa. 

15 aataatttga atgaaagcgc aaaaacttta gccggtggga caaccaattc 
caagccacgc ttttagcatt gagatcggcg ttagggttat ggaatagcat 
gtcgtatgcg gaggttatat taaaaaaccg ggcgaaaaca atcaaaaaaa 
accgatgaga atggcaacgg cactacaatc aattgcggtg ggagcacaaa 
actcatagtc ctaatggcac aaatacatta aaagcagaca aaaatgtttc 

20 gagcaatatg aaaaaatcca tgaagcctat caaatccttt caaaggcttt 
gggcttgctc ctttaaatag caaaggggaa aagttagaag cgcatgtaac 
gatcaacaag gaacatccag tgaccaaact acaaccacaa cttctgttat 
aatgatgcgc aaaatctttt gactcaagcg caaacgattg tcaataccct 
tgccccatgt tgatagcgaa atctagtagt aatggtggaa ctaatggcgc 

25 tcatggcaaa cagccggtgg cggcaaaaat tcatgtgcga cttttggtgc 
gctatttcag acatgattag taacgctcaa aaaatcgttc aagaaaccca 
gccaaccaac ccaaaaatat cacccaaccc aataatttca accttaactc 
cttacggctt tagctcaaag catgctcaaa aacgctcaat ctcaaacaga 
ttagccaatc aggtagcaag cgattttgac aaactttctt caggctatct 

30 atagggaaat gcgatgtgag tggtgtgagt agttcaaata tgacaccgca 
accacttggg ggaaaggctg cgcgggcgtg gaagaaactc taacttcgtt 
accactgatt ttaacaacca gacaacgccc caactcgatc aagcgcaaac 
acccttactc aagaactcgg caataaccct ttcaaacgag tgggtatcat 
accaataacg gggcgafcgaa tggccttggg gtgcaagcgg gttataagca 

35 caaaaaagaa ggtgggggtt aaggtattac ggcttttttg actacaacca 
aaatccagct tttttaactc gtcttctgac gttttgactt atggggtggg 
ttgtttaatt tcatcaatga taaaaacacc aatttcttag gcaagaacaa 
gtggggcttt ttggaggtat cgccttagca gggacttcgt ggcttaattc 
aatttaaaaa ccatcagcaa tgtttatagc gctaaagtga atacggctaa 

40 ttattcaatt taggcttgag aaccaatctc gctaggccta agaaaaaaga 
tccgcgcaac atggcatgga attgggcgtg aaaatcccta ccattaacac 
tcttacttgg gaactaaact agaataccga agactctata gcgtgtatct 
tttgcgtatt ga 
<212> Type : DNA 

45 <211> Length : 1932 

SequenceName : SEQ ID 490 
SequenceDescription : 



attgctccat 
ggttcaaaag 
aagcaagctt 
tcaagcggtt 
ccctgcctat 
gggctatgcg 
tttccactac 
tagtaatggc 
tctatctatt 
aaaacaagct 
cacatcaaag 
tgatacgact 
taaagattat 
aaacacccct 
ggagtttagt 
acaacttaac 
tcctggcagt 
aattttaaaa 
taaagattac 
aaatatgaat 
aaaagcaagc 
cctagccaat 
tggctctcaa 
attctttggt 
tacctacatc 
tagcgattta 
taagatttct 
tcaattcgtg 
tttccaattc 
cagcgatcat 
gaattactat 
caattatgtg 



60 
120 
180 
240 
3 00 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1932 



50 



55 



60 



65 



Sequence 

<213> OrganismName : Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgaaaaaaa cgattttact ttctctcatg gtgtcatcgc tctttgctga aaatgacggc 60 

gtttatatga gcgtgggcta tcaaatcggc gaagccgcac aaatggtgaa aaacaccggc 120 

gaaatccaaa aagtctccaa cgcttacgaa aatttgaaca accttttaac ccgctataat 180 

gaactcaaac aaacggcctc taacactgat tcaagcaccg ctcaagcgat tgacaatcta 240 

gaaaagagcg ctagcagatt gaaaacgacc cctaataccg ccaatcaagc cgtgtcctca 3 00 

gcgctcagct ctgcggtggg catgtggcaa gtgatagcct ctaatttagc caacaactcg 360 

ctatcttcta gcgaatacga aaaactcaa'a gcgacttctc aattgctcca aaatacccta 420 

gaaaataaaa acaataatct taaaattgaa aatgactatg accagctttt aactcaagct 480 

agtaccatta ttaataccct tcaaagccaa tgcccaggcg tagatggggg caatggcaaa 540 

ccatggggca ttaatacaag cgggaacgca tgcgctattt ttggtagcac ctttaacgcc 600 

attaatagca tgattgatag cgctaaaaaa gccgccgcag atgcccgaag aactgcccca 660 

gaaagtccaa accaacaaaa cgcgtttacc aacgctgatt tcaataaaaa cctcaatcaa 720 

gtctcaagcg ttatcaatga caccatctct tacctcaaag gggacaattt agaaaccatc 780 

tacaacacca ttcaaaaaac gcctaattct aaagggtttc aaagtttggt gagccggtct 840 

agctatagtt attctctcaa cgaaacccaa tattctcaat tccaaactac caccaaagag 900 
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tttggtcata accccttcag aagcgtggga 
atgaatggcg tgggcgtgca actaggctat 
gggatccggt attatggctt ttttgattac 
aactccgctt ccaatgtttt cacctatggc 
5 aatggcggat ccgatcgaaa ccgcaaagtc 
gcgggaacga catggcttaa taaccaatct 
agcgctaaga tcaacaacac caatttccaa 
gggatccatc atggcattga attaggcgtg 
tctttcatgg gcgctaaatt agcctaccgc 
10 ttggcttatt ga 

<212> Type : DNA 
<211> Length : 1452 

SequenceName : SEQ ID 491 
.... ~' SequenceDescription : 

15 

Sequence 



ttaatcaact ctcaaagcaa taacggggcg 960 

aagcaattct ttgggaaaaa taaatttttt 1020 

aactatgcgt atatcaaatc caattttttc 1080 

gcgggcagtg atcttfctatt gaacttcatc 1140 

tcttttggca tttttggggg catcgctcta 12 00 

gcgaatttaa aaatcaccaa tagcgcctac 12 60 

tfccttgttca ataccggttt aaggcttcaa 1320 

aaaatcccta cgatcaacac caattactat 13 8 0 

aggctttata gcttgtacct caattatgtt 1440 

1452 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<213> Organ isixiName : Helicobacter pylori, 
<400> PreSequenceString : 



strain J99 



atgccaaagg 
tttgaagcta 
aaaatcaata 
ttagggcttg 
atagggggca 
gfcggcttaca 
caaaccgata 
tttagcgacg 
tatctctcca 
tacaagcatg 
tttttaatgg 
aatggaggtt 
acctataaac 
aacgctccag 
cgctctgata 
tccaatggaa 
attatccgcc 
tttcaaaacg 
agtaacagcg 
ttcctttctt 
tactccaatt 
actgaatatt 



caagtcaggt 
agctcaacgg 
aagaaagagg 
gttttagcct 
tggtcggatc 
acctctttgg 
gccttgagac 
cgtatttaga 
ctatgcctta 
cgcgcttgac 



tttattcttt ggagcgtttt 
ctttgtggat caatccagca 
catctaccct atgcagcaat 
gttacccaaa aaggtttcag 
tattttctat gacggcacga 



gggtttatgg 
aaacacaaca 
aagaattatt 
caaacgcaag 
tcatggggta 
acttatagcg 



taagcacttc 
cgatcggttt 
tcgcaacgat 
accatgttct 
agaagtttga 
gggtctatac 
aaaatgtccg 
ttgaaataaa 
gctttcaagt 
gggcgtttgc 
gaggctttac 
gcacgcatgc 
acctttcgcc 



tctaacccta attttagcgg 
cccatttact acccttggat 



ttattacgat 
acagaacatg 
atacgcttat 
taaaagcggt 
ttggtttagc 
attggtttgc cgcacggacc 
atgatagcca tgggcgaaag gtgctttatg 
ctcatcgttt cctcatagaa ggcttttatt 
gcgttaagat tggttgggac 
cggctatcat agggtttttc 
gcccggtcta tagatacgac 
aacgctttga catcaacaat 
ctaatggttg gataggcaac 
tctatgcagg ttttacaggc 
gtggtggcac tcattttgcc 
cagtggtctc ttgggaagca agagcgatga 
tgagcggtag cgtggatctt gcgtattatg gcgtgcatac 
tttaaaccgg gtgaaaacgg gcctgtgcct aaaaacttcc ccgcccttta 
agcgctttat acacggctct agtagcgtct ttttga 
<212> Type : DNA 
<211> Length : 1416 

SequenceName : SEQ ID 492 
SequenceDescription : 

Sequence 



acgcctgcca 
tacaatgttt 
atggggaatc 
acagccctta 
aaaaaattca 



ctcaaaacgg 
caatcgcttt 
caagcggtgt 
aaagagacgc 
catggaaatt 
tctctttagg 



tttacaaggt 
taaccagcat 
tgcgggctat 
aaaaggcaaa 
agacggctct 
taatatctta 
caattatgtc 
agccgggcgc 
ttctgggcaa 
ttatggttcg 
caaaaacaat 
ggtgcaactc 
tcaaatcttt 
cacaggcttt 
gatcgttaaa 
gcaaaacctc 
ttataaagtc 
gatcatgggg 
cgctaccatt 
cgccacacaa 
ctataaattc 
taacaaaggc 
ttctgacagg 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgctaaggc tcgttagtaa aacgatttgt ttgtctttaa ttagcttgtt 
gaagcctttc aaaaacacca aaaagacgtc ttttttgtag aagctgggtt 
ctattagaag gcgcgcaaac caaagaacaa gcaatagccc aaaacaccca 
aaaatttatg aaaaccccct aacccacccc caaactaaag aacaacctaa 
aaaagcgata cagccacccc acaaagcgtc tatgggagat actacatcct 
attttagaaa aagcgactga gctattcaca gcggctaata tcaatggcaa 
ttttattctc aaaaccctgt gtatgtgatg gcatacaata aagataatgc 
ggctatggca ataacagcgt ggttgtgata caaaacttcc tgccctacaa 
attgagctga gttatacaga cgctcaaggc aaggcagtca atttaggcgt 
atccctaaag attctcaaat catcttgcct gcaagtttgt ttaataattt 
tcaccattca actctgatgg cctccaacaa ctccaaacca ctaccacccc 
gctaacacgc agagtttgtt tgaaaagctc agtcaaatca cgaccaatct 
tatgagaata cagacccctt ttctagcggc aacaacgatc ctaatggccc 
cctaaacctc attatgaatg ccctggttat aaaaagagtt gtcaagtcgc 
ttcaccccac aaaccgcaga agaattgacc aatttaatgt tagacatgat 



caacccttta 
tgaaaccggg 
aaacacccaa 
agaacaaaac 
ccaaaacacc 
cggcttaact 
cgagtttgaa 
tttaaacaat 
gatagagacc 
ttcaaacgat 
cttttctgat 
tcaaatgact 
tctcgcttct 
ttcggtgtct 
tgcggtgttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1416 
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gactctaaat cttgggaaga agccgtttta aacgcccctt tccaattttc taacagccca 960 

tcagagtgcg gcattgatta ccctaaatgc gttaatccct ttaataacgg gcttgttgat 1020 

cctaaagafcg aaaaatacgc gctaacccca gaagaggtta tcaatagtta tagagtcgcc 10 80 

aatgaactta ccgtgaacct cttgaatgcg gccaaggggt fctctagggct aggatcccaa 1140 

ctgggtagog ccaatgcccc cgatgatgat ggcttcaatc aaggtgtttt agggatagcg 1200 

ccttttgctt tagatcctga aaaattgttc ggtaaaaatt tgaataaagt ggctattttg 1260 

gcattaaga-g acattatcca tgaatatggg catactttag gctataccca taacgggaac 1320 

atgacttatc aaagggtgcg tttatgccaa gaaggaaacg ggccagaggc acgctgtgag 1380 

ggcgggcafcg aagtggagaa aaacggcaaa gaagagctag aattcagtaa tgggcatgaa 1440 

gtgcgagaoc atgatggtta cacctatgat gtttgctctc gttttggcgg caaaaatcag 1500 

cccgctttcc ctagcaatta ccccaattcc atctatacca attgcgctca agtccccgct 1560 

gggcttata-g gggttactac cgctgtttgg caacagctca tcaatcaaaa cgccctgccc 1620 

attaatttcg ctaacctaaa tagccaaacc agccatttaa acgccgggtt gaatgcgcaa 1680 

aatfcttgcaa cctctatggt cagcgcgatc gcgcaaaatt tttccaccac ttccactacc 1740 

acttaccgct cttcaagtaa gaattttaga agccctattt taggggttaa tgttaaaata 18 00 

ggctaccaa-c attatttcaa tgactacatc gggttagcct attacggcat tatccaatac i860 

aactacgctc aagctaacga tgaaaaaatc cagcaattaa gctatggtgg gggaatggat 1920 

gtgctgtttg atttcatcac cacttacacc aataaaaagc aagaccatcc aactaaaaag 198 0 

gtttttgctt cctcttttgg ggtgtttggg gggttaaggg gcttatacaa tagctactat 2040 

gtcttcaafcc aagtcaaagg aagcggtaat ttagatatag taaccgggtt taattaccgc 2100 

tacaagcatt ctaaatattc cataggcgtt agcgttcctt taatccaaag cggtattaag 2160 

atcgcttcfca ataatggtat ctatgcagac tctgttgttt tgaatgaagg gggtagccat 2220 

tttaaagtgt tttttaatta cgggtgggta ttttaa 2256 
<212> Ty£>e : DMA 
<211> Length. : 2256 

SequenceName : SEQ ID 493 

SecjuenceDescription : 

Sequence 



<213> OrcyanismName : Helicobacter pylori, strain J99 
<400> Pre^SequenceString : 

ttgcaaaact ttgtttttaa taaaaaatgg ctcatctatt ctagcctact ccccttattt 60 
tttcttaa.cc ctttaatggc agaagacgat gggtttttta tgggggtgag ttatcaaact . 120 

tctttggcog ttcaaagggt ggataactca gggcttaacg ccagtcaaga cgcatccact 180 

tatatccgoc aaaacgctat cgctctagaa tctgcggcgg tgcctttagc ctattattta 240 

gaagcgatgg gccaacaaac gagagtctta atgcaaatgc tctgccctga tccttctaaa 3 00 

agatgtttgc tctatgcagg gggctatcaa aacggacaaa ataataacgg cgatacaggc 360 

aacaaccooc caagaggcaa tgtcaatgcc acctttgata tgcaatctct agtcaataat 420 

ttaaacaacfc tcacccaact catcggcgaa actfctaatcc gtaaccctga aaatcttcct 480 

aactccaaag tctttaacgt caaatttggc aatcaaagca ctgttattgc attgcctgag 540 

ggtctagcca ataccatgga cgctttaaac aatgacatca ccaacgcttt aaccacgctc 600 

tggtataacc aaaccttaac gaataaatct tttagcaccc ctagtaacac ttctgtgaat 660 

tttagccccc aagtcttgca acacctttta caagacggct tagccacagc aaataataat 720 

caaaccattt gcagcactca aaaccaatgc accgccacta atgaagctaa atctatcgct 780 

caaaacgccc aaaacatctt ccaggcttta atgcaagcag ggattttagg gggcttagcc 840 

aatgaaaagc aatttggctt cacttacaac aaagccccca atggcagcga ttcccaacaa 900 

ggctatcaaa gctttagcgg cccgggttat tacaccaaaa acgacaacac cacgcaagcg 960 

cccttaaaag cattacccgc tggagcgaca attggatcag gcaatggcca atacacctac 1020 

caccccagct cggcagtcta ttatttagcc gatagcatca tcgctaatgg catcaccgct 108 0 

tctatgattt tttcaggcat gcaaaatttc gccaataaag ccgctaaact gataggcact 1140 

tcaagctata accagatgca agatgcgatc aactatgggg aaagcttgct tagtaacacc 1200 

gtagcgta.tg gggatttcat caccaattgg gtcgccccct atttggattt aaacaataaa 1260 

ggtttgaafct tcttgcctaa ttatgggggg caattgaatg gcgctaataa tcaaacccca 13 2 0 

caattaaccc cacaacaagc ccaacaagaa caaaaagtga tcatgaacca attagagcaa 1380 

gccacaaacg cccccacccc cgcgcaaata aacaggattt tagccaaccc ctattccccc 144 0 

acggcaaaaa ctttaatggc ttatgggctc tatcgctcta aagcagtgat tggcggagtg 1500 

attgatga-aa tgcaaactaa agtgaatcaa gtctatcaaa tgggctttgc taggaatttt 1560 

ttggagca-ta actctaattc taataacatg aacggctttg gcgtgaaaat gggctataag 1620 

caatttttcg gcaaaaagcg catgtttggg cttaggtatt atggttttta tgattttggt 1680 

tacgctca.a.t ttggcacaga atcttcttta gtgaaagcca ccctctctag ctatggagcg 174 0 

ggcacagact ttctttataa tgtttttacc cgaaaaagag ggactgaagc gatagatata 18 00 

ggtttttcfcg ccggtatcca acttgcaggg caaacctgga aaacgaattt tttagatcaa 1860 

gtggatggca accatcttaa acctaaggac acttctttcc aattcctttt tgatttgggc 1920 

ataaggacca atttttccaa aatcgctcat caaaaaagat cccgtttttc tcaagggata 1980 

gaatttggcc ttaaaatacc ggtgctttat cacacctatt accaatcaga aggcgttaca 2040 

gcgaagtata gaagagactt tagtttttat gtgggctaca acataggctt ttga 2094 
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50 



Sequence 



60 



<212> Type : DNA 
<211> Length : 2094 

SequenceName : SEQ ID 494 
5 SequenceDe script ion : 

Sequence 

<213> OrganismName : Helicobacter pylori, strain J99 

10 <4O0> PreSequenceString : 

atgaaaaaga caattctact ctctctctct ctctctctcg cttcatcgct cttgcacgct 

gaagacaacg gcttttttgt gagcgcgggc tatcaaatcg gcgaagcggt gcaaatggtc 12 0 

aaaaacaccg gtgaattgaa aaacttgaac gaaaaatacg agcaattaag ccagtattta 180 

aatcaagtgg cttcgttgaa gcaaagcatt caaaacgcca acaacattga gctggtcaat 24 0 

15 agctctttaa actatttaaa aagctttacc aacaacaact acaacagcac cacccaatcg 30 0 

cccatcttta acgccgtgca agccgttatc acttcggtat tgggtttttg gagtctttat 360 

gcggggaact atctcacttt ttttgtggtt aataaggata ctcaaaaacc cgctagtgtc 420 

cagggtaacc ctcctttttc aactattgtt caaaactgct caggaattga aaactgcgcg 480 

atgaatcaaa ccacttatga taagatgaaa aagctcgctg aagatctcca agcagcccaa 540 

20 caaaacgcta ccactaaagc gaacaatctt tgcgctttat ccggatgcgc cacaacacaa 600 

ggccaaaacc caagctcaac cgtaagcaac gctcttaact tagcgcaaca gcttatggat 660 

ttgatcgcaa acactaagac ggctatgatg tggaaaaata tcgtcatcgc tggcgtttca 72 0 

aacgtatccg gcgctatcga ttccactggc tacccaacgc aatacgcggt gtttaacaac 78 0 

attaaggcga tgatacctat cttgcaacaa gcggttacgc tttctcaaag taaccacaca 840 

25 tta-tctgcta gcttgcaagc tcaagctaca ggatctcaaa caaaccccaa attcgctaaa 900 

gacatctacg ctttcgctca aaaccaaaag caagtcattt cttacgctca agacattttc 960 

aacctcttta gttctatccc taaagatcag tatcgttatt tagagaaagc ctatttgaaa 102 0 

atacccaatg cgggtaaaac gcctactaac ccttacagac aggaggtgaa tttaaaccaa 10 80 

gaaattcaaa cgatccaaaa caatgtgagt tattatggta atcgggtgga tgcggcttta 1140 

30 agcgtggcta aagatgttta taatttaaaa tccaatcaaa cagaaatcgt aaccacctat 12 00 

aacaacgcta agaatttgag ccaagagatt tctaaactcc cctataacca agtcaataca 12 60 

aaagacatta tcacactgcc ttacgatcaa aacgctccgg cagcgggcca atacaactac 13 2 0 

cagatcaacc cagagcagca atccaatctt tctcaagctt tagcggcgat gagcaataac ,13 8 0 

ccctttaaaa aagtgggcat gatcagctct caaaacaata acggcgcttt gaacgggctt 1440 

35 gg-cgtgcaag tgggttataa acaattcttt ggcgaaagca aaagatgggg gttaaggtat 15 00 

ta_tggtttct ttgattacaa ccacggctat atcaaatcca gcttttttaa ttcttcttct 1560 

gatatatgga cttatggcgg tgggagcgat ttgttagtga attttatcaa cgatagcatc 1620 

acaagaaaga acaacaagct ttctgtgggt ctttttggtg gtatccaact agcagggact 1680 

acatggctta attctcaata catgaattta acagcgttca ataaccctta cagcgcgaaa 1740 

40 gtoaatgctt ccaatttcca atttttgttc aatctcggct tgaggacgaa tctcgctaca 1800 

gctaagaaaa aagacagcga acgttccgcg caacatggcg ttgaactggg cattaaaatc 1860 

cctaccatta acaccaatta ttattctttt ctaggcacta agctagaata cagaaggctt 1920 

tatagcgtgt atctcaatta tgtgtttgct tattaa . 1956 
<212> Type : DNA 

45 <2 11> Length : 1956 

SequenceName : SEQ ID 495 
SequenceDe script ion : 



<213> OrganismName : Helicobacter pylori, strain J99 
<4O0> PreSequenceString : 

atgttaaaac tcgccagtaa aacgatttgt ttgtccctaa tcagctcatt cacggctgta 60 

ga.agcctttc aaaaacacca aaaagacggc tttttcatag aagccggctt tgaaaccggg 12 0 

55 ctattacaag gcacacaaac ccaagaacaa accatagcca ccactcaaga aaaacccaaa 180 

cccaaaccca aaccaaaacc cattacccct caaagcacct atgggaaata ctacatctcc 24 0 

caaagcacca ttttaaagaa tgcgactgag ttgtttgcag aggataatat caccaactta 3 00 

accttttact ctcaaaaccc tgtgtatgta accgcttata accaagaaag cgctgaagaa 3 60 

gctggctatg gtaataacag cttgattatg atacaaaact tcttgcctta taacttgaac 42 0 

60 aacattgagc tgagttacac ggacgatcaa ggcaatgtgg tcagtttggg cgtgatagag 48 0 

actatcccta aacaatctca aatcattctg cccgcaagct tgtttaacga cccacagctt 54 0 
aacgccgatg gcttccaaca actccaaacc aacaccacac gattttctga tgccagcacg 
cagaatctgt ttaacaagct cagcaaggtt acaaccaatc ttcaaatgac ttatatcaat 
tacaaccaat tttctagcgg taacggcagt ggctctaaac ccccatgccc cccatacgaa 

65 aaccaagcaa attgtgtggc taaagtgccg cctttcacct ctcaagacgc taaaaatttg 

accaatttaa tgctgaacat gatggcggtg tttgattcta aatcttggga agacgccgtc 840 

tfcaaacgctc ctttccaatt cagcgacaac aacctgtcag cgccatgtta ttctgattac 900 



600 
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720 
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cttacatgcg tgaatcctta caacgatggg cttgttgatc ctaaattgat cgccaaaaat 960 

aaaggagatg aatacaatat agaaaacggg caaacaggct cagtgatatt aacgccgcaa 1020 

gatgttatct atagctatag agtcgctaat aatatttatg tgaatctctt gcccacaaga 1080 

ggaggggatt tagggttagg gtctcaatat ggtggcccga atggcccagg cgatgatggc 1140 

5 accaattttg gcgctttagg gatattgtcc cctttcttag accctgaaat attgtttggc 1200 

aaagaattga ataaagtcgc catcatgcaa ttaagagaca tcatccatga atacggccat 1260 

acttfcaggct atacgcataa cgggaacatg acttatcaaa gagtgcgcat gtgcgaagaa 1320 

aacaa-tgggc cagaagagcg ctgtcagggc ggaaggatag agcaagtgga tgggaaagaa 1380 

gtgcaagtgt ttgacaacgg gcatgaagtg cgagacaccg atggctctac ctatgatgtg 1440 

10 tgttctcgtt ttaaagataa gccctataca gcgggcagct atcctaattc catctatacc 1500 

gattgctctc aagtccccgc tgggcttata ggcgttacca gcgctgtttg gcaacaactc 1560 

attgatcaaa acgccctacc ggtggatttt actaatttga gcagccaaac caactatttg 162 0 

aacgccagct tgaacacgca agactttgcg accaccatgc ttagcgcgat cagtcaaagc 168 0 

ctttcatctt, ctaaatctag cgccactact tatcgcactt eaaaaacctc acggcccfctt 1740 

15 ggagcccccc tattaggcgt taatcttaaa atgggctatc aaaaatattt taatgattat 18 0 0 

ctagggttgt cttcttatgg cattatcaaa tacaactacg ctcaagccaa caacgaaaaa 1860 

atccagcaat taagctatgg cgtgggaatg gatgtgctgt ttgatttcat caccaattac 192 0 

actaacgaaa agaaccccaa aagcaatcta accaagaaag ttttcacttc ctctcttggg 1980 

gtgtttgggg ggttaagggg cttatacaac agctattatt tgttgaacca atacaaaggg 2 04 0 

20 agcggjtaatt taaatgtgac cggtgggttg aattaccgct acaagcattc caaatattct 2100 

atagg-catta gcgttccttt ggtccagttg aaatctagga tcgtttctag cgatggtgct 2160 

tataccaatt ctatcaccct caatgaaggg ggcagtcatt ttaaagtgtt ttttaattac 222 0 

gggtggattt tctaa 2235 
<212> Type : DNA 

25 <211> Length : 2235 

SequenceName : SEQ ID 496 
SequenceDescription : 

Sequence 
30 

<213> Organi smNarae : Helicobacter pylori, strain J99 

<400> PreSequenceString : 

atgagaaaac tattcatccc acttttatta 

ggctttttca tagaagccgg ctttgaaact 
35 aaaagacaca ccaccacaaa aaacacttac 

attttaaaaa gagcggctaa tttattcacc 

tcatctttat cccctgttag agtgttgtat 

ttcttgcctt ataatttaaa taatgttaag 

atcgatctag gcgtgataga gactatcccc 
40 gcatttgata gtctaaaaat tgacccctat 

agcacttcta tttctgacgc taacacgcag 

acaaatttgg tcgtaaatta taggaatgaa 

gaagccttta ccccacaaac cgcagaagaa 

gttttagact cccaatcttg gggcgatgcg 
45 agcccaacag attgcgataa tgatccttca 

gtcaattcta aagtcgatca aaaatatgtg 

aaaaacaaag cggatcttga tgtaattgtt 

agtgatatta cccctagcaa caatgatgat 

gcttctgctt tagatcctaa aaaactcttt 
50 gatttaagaa ccatcttgca tgaattcagc 

atgacctatc aaagagtgcc ggtaacgaaa 

aagccaaaag attctgatgg cctcccctat 

cagcccgctt tccctagcaa ctaccctaat 

gctggctttt taggggtaac agcagcggtt 
55 ccgatcaact acgctaactt ggggagtcaa 

acgcaagatt tagccaattc catgctcagc 

gttaccaacc accatttttc aaacgcatcg 

aacgctaaaa taggctatca aaactacttt 

atcatcaaat acaattacgc taaagctgtt 
60 ggggggatag atttgttatt ggatttcatc 

ggcattcaaa ccaaaaggaa tttttcttca 

ttgtataaca gctattatgt gttgaacaaa 

accgcjgttga actaccgcta taagcattct 

atccaaagaa aagctagcgt cgtttctagc 
65 aatgaagggg ctagccactt taaggtgttt 

<212 > Type : DNA 



ttcagcgctt tagaagcgaa 
gggctattag aaggcacaca 
gcaacttaca attatttacc 
aatgccgaag cgatttcaaa 
atgtataatg gtcaattaac 
cttagtttta cagacgctca 
aaacactcta agattgtttt 
actttatttc ttccaaaaat 
agggtgtttg aaacgctcaa 
aacaaattta aagatcacga 
ttcactaatt taatgttgaa 
atcttaaacg ctccttttga 
aaatgcgtaa atcctgggac 
ttaaacaaac aagacattgt 
ttaaaggatt caggggttgt 
ggcaagcatt atggccagtt 
ggcgataacc ttaagactat 
cacactaaag gctatgggca 
gatggtcaag tggaaaagga 
aatgtgtgtt cgctttatgg 
tccatctatc acaattgtgc 
tggcagcagc tcatcaatca 
acaaactaca acctaaacgc 
accatccaaa aaacctttgt 
caaagtttta gaagccctat 
aatgatttca tagggttggc 
aatcaaaaag tccagcaatt 
accacttact ccaataaaaa 
tcttttggta tctttggggg 
gtcaaaggaa gcggcaattt 
aaatattctg tagggattag 
ggtggcgatt atacgaactc 
ttcaattacg ggtgggtgtt 



cgagaaaaac 60 

aacgcaagaa 120 

cacagacacg 180 

attaaaattc 240 

tatagaaaac 30 0 

aggcaatgtg 360 

gcccggagag 420 

t gaagc cac t 480 

taagattaag 540 

aaatcattgg 600 

catgatcgct 660 

gttcactaac 72 0 

aaacgggctt 780 

caataaattt 840 

agggcttggg 900 

aggggtagta 960 

caatttagag 102 0 

taacgggaat 1080 

tagtaatggc 1140 

gggatccaat 1200 

ggatgtcccg 1260 

aaacgccttg 132 0 

tagtttaaac 1380 

aacttctagc 1440 

tttaggggtt 150 0 

ttattatggc 1560 

gagctatggt 1620 

tagccctaca 1680 

gttaaggggc 174 0 

agatgtggct 18 0 0 

catcccttta 1860 

ttttgttttc 1920 

ttag 1974 
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<211> Length : 1974 

SequenceName : SEQ ID 497 
SequenceDescrription : 



5 Sequence 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceSfcring : 

atgagtctag ctacgagfcta caatgtgagt aataattttt ctaagtttaa tattaagaga 60 
10 gtcagaggat atttgattitg tcttgtttgt aacacaccta aaatgataca aagaggattg 120 
aatggtgtct cattttatgg ttgctctgat tatgtaaata aaggcgactg taagggcgtt 180 
ttacgagaaa taaatggctc aatgaaaatg gtctgcttac attgtgaaaa cacgcccata 240 
atggaaaaag tagaaagfcgg taggggagga gcttacgctt gtaagaattg caataggaag 3 00 

ttttacttta- tcgatcttgc aaaacaaaac gaacgaaaaa aagatttaga^aaaagaaaaa 3 60 

15 aaagaattgc ttaataagat tgaaaagcaa aaaatcaaac accttgagcg tttcattttg 420 
gctggtgtaa aagctaatat taaagaaaat tcttttttct taggatgtaa aaattatcct 480 
aagtgcgaat ggactgcfcag tatggattca caagatctta aatgtcccaa atgcaacaga 540 
ttaatgaaaa gaaaaaagaa tttcaaaaac aatgagtttt ttacagctac atcgcttacc 600 
ttaaatgcaa tagaattttg tctctatatt aatttgaaaa aaaaggaaac caatgtttag 660 

20 

<212> Type : DNA 
<211> Length : 660 

SequenceName : SEQ ID 498 

SequenceDescription : 

25 

Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenc est ring : 

30 ttggaaatta agaaatattt tctttacgct ctattttttt tgcttttttc tggtcttttt 60 

ttatccaaac ttcaagctta taaattcaac atgagtattg ttggaaaggt gagcagctat 120 

actaagtttg gctttaacraa ccaaagatac cagccttcta aagacattta tcctacaggt 180 

agttatactt ctttactcgg cgaattgaat ttgagcatgg gattatacaa gggcttgagg 240 

gcagaagtag gggctatgjat ggcagcgctt ccctatgact ctaccgccta tcaaggcaat 3Q0 

35 aatatcccta atggccagcc cggatctagg acggatcctt ttggggcggg tatcttttgg 360 

caatacattg gctggtatgc aggacatagc ggtttaaacg tgcaaaaacc tcgtttggct 420 

atggtgcata acgctttfctt gagctacaac tacaagaaag acaaattcag ttttggcgtg 480 

aaaggggggc gctatgatgc tgaagagtat gattggttca cttcttacac tcaaggggtt 540 

gaaggctttg tcaaatacaa agacaccagg ttaagggtga tgtattcaga cgctagggct 600 

40 tcagcgtcaa gcgactggtt ttggtatttt gggcgttact atacaagcgg taaggctcta 660 

atgattgcgg atttgaaata cgaaaaagac aacctaaaaa tcaaccctta tttttatgcg 720 

atctttcaaa gaatgtatgc gccaggcatt aatatcactt acgacaccaa ccctaatttc 780 

aacaataagg gctttcgttt tgtaggcact ttcgtggggt ttttccccat ttttgccact 840 

ccggctaatc aaaatgatrat tatcctattc caacaagtgc cattaggaaa gagcgggcaa 900 

45 acttatttct tccgcactcg tttttactat aacaagtggc aatttggggg tagcgtctat 960 

aaaaatatcg gtaacgctaa tggcgatata ggtatttatg gcgaccctct agggtataac 1020 

atttggacga atagtattta tgacgcagaa attaacaata tcgttggcgc tgatgttatt 1080 

aacgggtttt tatatgfca.gg ctcgcagtat agggggttta gttggaaaat tttaggccgt 1140 

tggacggata gcccaagggc tgatgaaagg agtctcgcgc tctttttgag ttatttttct 120 0 

50 aataagtata atattagaat ggatttgaaa ctagaatatt atggcaatat caccaaaaaa 1260 

ggctattgta ttgggtafctg tggcatgtat gttccagttg atcctaatgg gcctggcacg 1320 

caacctttaa cacacaacgt gtattctgac aggagccata tcatgtttaa cattacttat 1380 

ggttttagga tttactag 13 98 
<212> Type : DNA 

55 <211> Length : 1398 

SequenceName : SEQ ID 499 
SequenceDescaription : 



Sequence 
60 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceS "bring : 

atgaaaaaga caattctact ctctctctct ctctctctcg cttcatcgct cttgcatgct 60 
gaagacaacg gcttttttgt gagcgcgggc tatcaaatcg gcgaagcggt gcaaatggtc 120 
65 aaaaacaccg gcgaattgaa aaacttgaac gacaaatacg agcagttaag ccaatcttta 180 
gcccaactgg cttcgtta.aa aaaaagcatt caaacggcga acaacattca ggctgtcaac 240 
aatgctttaa gcgatttaaa aagctttgcg agtaacaacc acacaaacaa agaaacatcg 300 
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10 



15 



20 



25 



cccatctaca 
gcagggaacg 
ggaagaat cc 
acttatgata 
gccttatcag 
cttcaaaccg 
aaaaatatcg 
actggtcatg 
caacaagcgc 
gctatgggat 
caaaagcaaa 
gaccaactta 
actaaccctt 
gtagctaatt 
ctaaaatcca 
gagatttcta 
gattctacag 
ttagcggcga 
aacggcgctt 
aaaagatggg 
agctttttta 
aattttatca 
ggtatccaac 
aataaccctt 
ttgaggacga 
gttgaactgg 
aagctagaat 



acaccgcgca 
ctctcagttt 
atagagatgg 
aaatgaagac 
aatgctctag 
cgcaacagct 
tcatcgcagg 
taaccgacta 
ttacgctttc 
ctcaaacaaa 
tcct.tt.ctaa 
agtatttgga 
acagacagaa 
atggtaatcg 
atcaaacaga 
aacttcccta 
cgggccaata 
tgagcaataa 
tgaacgggct 
ggttaaggta 
attcttcttc 
acgatagcat 
tagcagggac 
acagcgcgaa 
atctcgctac 
gcattaaaat 
accgaaggct 



agctgttatc 
tcatgtgacc 
gaactgcaca 
acttgccgaa 
caatcaatca 
catggactta 
tgttacaaac 
tgcggtgttt 
tcaaagtaac 
tcgtgaattc 
cgcttcaagt 
gaacgcttac 
tgtgaatttg 
tttggattcg 
gatcgtaacc 
taaccaagtc 
ccaaatcaac 
cccctttaaa 

tggcgtgcaa 

ttatggtttc 
tgatatatgg 
cacaagaaag 
tacatggctt 
agtcaatgct 
agctaagaaa 
ccctaccatt 
ttatagcgtg 



acttcagtat 
ggtttgaatg 
ggattacaac 
aacctccaaa 
aatggaggca 
atcgaacaga 
aaacccaatg 
aacaacatca 
cacaccctat 
gctaaagaca 
atcttcaatc 
ttgaaagtgc 
aataaagaaa 
.gctttaagcg 
acttataacg 
aatgtaacaa 
ccagagcagc 
aaagtgggca 
gtgggttata 
tttgattaca 
acttatggcg 
aacaacaagc 
aattctcaat 
tccaatttcc 
aaagacagcg 
aacaccaatt 
tatctcaatt 



tggctttttg 
atggatctaa 
aatgttttat 
aagctcaagg 
aaacttccat 
ccaaggtttc 
gtgctggcgc 
aggcgatgct 
ccactcagtt 
tctacgcttt 
tctttaattc 
cacatttggg 
ttaatgcggt 
tggctaaaga 
atgctaagaa 
acatcgttat 
aatccaatct 
tgatcagctc 
aacaattctt 
accacggcta 
gtgggagcga 
tttctgtggg 
acatgaattt 
aatttttgtt 
aacgttccgc 
attattcttt 
atgtgtttgc 



gagtctttat 
ttctccttta 
gagcaaagaa 
caatctctgt 
gactacagct 
tatggtgtgg 
tatcacatcc 
acctatcttg 
gcaagctcga 
agctcaaaac 
cattcctaaa 
taaaacccct 
tcaagacaat 
, tgtttataac 
tttgagcgaa 
gtcgcctaaa 
taaccaagct 
tcaaaacaat 
tggcgaaagc 
tatcaaatcc 
tttgttagtg 
tctttttggt 
aacagcgttc 
caatctcggc 
gcaacatggc 
tctaggcact 
ttattaa 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1917 



<212> Type : DNA 
30 <211> Length : 1917 

SequericeName : SEQ ID 500 
SequenceDescription : 



35 



40 



45 



50 



55 



60 



65 



Sequence 

<213> Organ! smName : Helicobacter pylori, strain J99 
<400> PreSecnienceString : 

atgaaattaa aaaaacgaaa agttgcggct acattgctaa agcgtttgac 
ttgttcacta cgggttcatt aggggcggtt acttatgaag tgcatgggga 
ttctccaaag tgggttttaa ccgttcgcct attaaccctg ttaaaggtat 
gaaacttttg ttaaccttac gggtaagcta gaggggtctg tgcatttagg 
accgtgaatg taggcggtgt tttgggcgga caagtttatg ataacactag 
tgggcaaagg attttacccc cccaagctat tgggataaaa cttcttgcgg 
ttgagccttt cjtatgaatgc gactaaaatg tggcaacagc aagggccagg 
gaccctaggg gtattggcta tatgtatatg ggtgagtgga acggcttgtt 
tatccggcta acgcctactt gcccgggcat tcaaggcgct atgaagttta 
cttacctatg acagcgacag agtccatatg gtaatggggc gctttgatgt 
gagcaaatgg attggattta ccaattgttc caaggttttt atgggacttt 
aagaacatga aattcttgct ctttagctct tggggtcgtg gtatcgctga 
ttgttcccta tctatcgtga aaagccttgg ggtattcata aggcgggtat 
cctacaaaga atctaatgat ccacccttat gtgtatctca tcccaatggt 
cccggtgcta aaatagaata cgataccaat cctgagttta gcggtagagg 
aaaacgactt tctatgtgtt gtatgactat cgttggaata acgctgaata 
gcacccgctc gttataacac ttgggatccg ttcttggata atggtaagtg 
caaggtcctg gtggtgcgac gctctatttg caccaccaca tagacattaa 
gtggttggtg gtgcttacct caacatcggt aaccctaaca tgaacttagg 
aaccctgtgg ctcttgatgg tatcgaacaa tgggtcggtg gcatctacag 
gcggggattg acaacattac cgatgctgat gcgttcactg agtatgttaa 
aagcatggta agttcagttg gagcgtttat cagcgcttca ctaccgcacc 
gaatatggta ttggtatgta tctagactat cagttcagca agcatgttaa 
aaactcgtgt ggttagagtt ccaaatccgt gcgggttaca accctggaac 
gggccaaacg gtcagccgct caacttgaat aatggtttgt ttgaatcttc 
caaggccctc aaaacatggg tggtatcgca aaaagcatta ctcaagacag 
atgacacaca tcagttatag tttctaa 
<212> Type : DNA 
<211> Length : 1587 

SequenceName : SEQ ID 501 



cttgccacta 
ttttatcaac 
ctatcctaca 
taggggatgg 
gtatgatagg 
cactgattct 
tggtatcatt 
ccctaattac 
taaagcgaat 
taccgagcag 
caagcttact 
tggtcaatgg 
tatttatcgc 
aggtacattg 
tataaggaat 
cggccgttac 
gcgtggcttg 
caactacttt 
tacttggggt 
cttaggcttt 
aggtggaggt 
aagggctttg 
agcgggtctc 
cggtttcctt 
ggcgttcgcg 
aagccatttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1587 
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SequenceDescription : 



Sequence 



5 <213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaact tttccccact ctattgfcctt aaaaagctca aaaaacgcca tttaatcgct 
ctgagtctgc ccttgctttc ttatgcgaat ggctttaaaa tccaagagca aagcttgaat 
ggcacggctt taggctcggc gtatgtcgct ggggctaggg gtgctgacgc ttctttttac 

10 aacccggcta acatgggctt tactaacgat tggggcgaaa acagaagcga atttgaaatg 
accaccaccg tgatcaatat cccggccttt agctttaaag tccctacgac caatcaaggc 
ttatattcgg taacaagttt agaaattgat aaaagccaac aaaatatttt aggcatcatc 
aacactatag ggttaggcaa tatccttaaa gcgcttggca atacggccgc taccaatggc 
ttatcacaag cfcatcaatcg tgttcaaggg cttatgaact taaccaatca aaaagtcgta- 

15 accctcgctt caaaacctga cactcaaatc gtgaatggct ggacaggcac gactaatttt 
gttttaccta aattctttta taaaacgcgc acgcataacg gcttcacttt tggggggagt 
tttaccgctc ctagtgggtt gggtatgjaaa tggaatggta agggggggga atttttgcat 
gacgtgttta tcatgatggt agagcttgcc cctagcatga gttatactat taataaacgc 
ttttctgtgg gtgtgggttt aagggggctt tatgcgaccg ggagctttaa taacaccgtt 

20 tatgtgcctt tagagggcgc ttcagtfcttg agcgcggagc aaatcttaaa cttacccaac 
aatgtttttg ccgatcaagt gccaagtaac atgatgactt tattaggcaa tattggctac 
caaccagcgc ttaattgcca aaaagccggt ggggacatga gtgatcagag ctgtcaagag 
ttttacaacg gcttgaaaaa aatcatgggt tatagcggtt taatcaaagc gagcgcgaat 
ctttatggca cgactcaagt cgtgcaaaaa tctaacggac aaggcgtatc gggggggtat 

25 agagtgggtt cgagtttgcg tgtgttfcgat catggcatgt tttctgtggt gtataattct 
tcagttacct ttaacatgaa aggcggtttg gtggctatca cagagcttgg cccttcttta 
gggagcgttt tgactaaagg cagcttgaat atcaatgttt cactccccca aactttaagc 
ttagcctacg cccaccaatt ttttaasgat cgcctaaggg ttgaaggggt gtttgagcgc 
actttttgga gtcaagggaa taaattfctta gtcacccctg attttgcgaa cgccacttac 

30 aagggcttga gcgggacggt ggcttccttg gactctgaaa cgcttaaaaa aatggtaggc 
ctagcgaatt ttaaaagcgt gatgaacatg ggggctggct ggagggacac caacaccttt 
agattagggg taacttacat gggtaaaagc ttgcgtttaa tgggcgctat tgattatgat 
caagccccaa gcccccaaga cgcgataggc attccggact ctaatggcta taccgtggct 
tttgggacta aatacaattt taggggcrttt gatttgggcg tagcggggag tttcactttt 

35 aagagcaacc gctccagttt gtatcaatcc ccaactattg ggcaattgag aatctttagc 
gcctctttag gctatcgctg gtaa 
<212> Type : DNA 
<211> Length : 1764 

SequenceName : SEQ XD 502 

40 SequenceDescription : 



60 
120 
180 
240 
300 
360 
420 
. 480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1764 



Sequence 



<213> OrganismName : Helicobactei 

45 <400> PreSequenceString : 

atggcttttc aggtcaatac aaatatcaat 
actcaaaacg cacttaaaac ttcattggag 
gcggctgatg acgcatcagg catgacggtg 
ttgggtcaag cgattgccaa cacgaatgac 

50 gctatggatg agcaattaaa aatcttagac 
caagatgggc aaactacgga atctcgtaaa 
caaggtttgg ataatatcgg taacacaacg 
caattcacta acaaagaatt ccaagtaggg 
atcggctcta ccacttccga taaaatcggt 

55 acggcttctg gggatattag cttgactttt 
ttagagagcg taaaagtttc tagttda.gca 
atcaataaaa actctaaccg aacaggggtt 
gatgtggcgg tccagtcagg aagtttcjagt 
aatatcgcag atattaagaa aaacgactca 

60 gtcacttcag aaaccggtgt ggaagcttat 
agtatagatg gtcgtgggat tgaaatcaaa 
ttaacgatgg tcaatggcgg tcaggattta 
tctctcacac gattagacgc taagagcatc 
ttaggcttca cggcgattgg ttttggggaa 

65 cgcgatgtta ctgggaattt taacgctaat 
gccgtgatcg ctagcggtaa ccaaagcttg 
atggtggtga ttgacattgc cgaatcagcg 



• pylori, strain J99 

gcgatgaatg cgcatgtgca atccgcactc 60 

cgattgagtt caggtttaag gatcaataaa 120 

gcggattctt tgcgttcgca agcgagcagt 180 

ggcatgggga ttatccaggt tgcggataag 240 

accgttaagg ttaaagcgac tcaagcggct 300 

gcgattcaat ctgacatcgt tcgtttgatt 360 

acttataacg ggcaagcgtt attgtctggt 42 0 

gcttattcta accaaagcat taaggcttct 48 0 

caggttcgta tcgctacagg cgcgttaatc 540 

aaacaagtgg atggcgtgaa tgatgtaact 60 0 

ggcacaggga tcggcgtgtt agcagaagtg 660 

aaagcttatg cgagcgttat caccacgagc 72 0 

aatttaacct taaatgggat tcatttgggt 780 

gacggaaggt tagtcgcagc gatcaatgcg 840 

acggatcaaa aagggcgctt gaatttgcgc 900 

accgacagcg tcagtaacgg gcctagcgct 960 

acaaaaggct ctactaacta cggaaggctt 102 0 

aatgtcgttt cggcttctga ctcacagcat 1080 

tctcaagtgg cagaaaccac ggtgaatttg 1140 

gtcaaatcag ccagtggcgc gaactataac 12 00 

ggatctgggg ttacaacctt aagaggcgcg 1260 

atgaaaatgt tggataaagt ccgctctgac 1320 
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ttaggttctg tgcaaaatca aatgattagc 
aatgttaaag cggctgaatc tcaaatcagg 
ttcaacaaaa acaacatttt ggcgcaatca 
gttcaacaaa atatcttaag gcttttaact 
5 <212> Type : DNA 

<211> Length : 1533 

SequenceName : SEQ ID 503 
SequenceDescription : 

10 Sequence 



<213> OrganismName : Helicobacter 
<40 0> PreSequenceString : 
- atggcaggca caeaagctat atiatgaatca 

15 tcaatcatct caagcacaag tcjgtgtcgca 
atgacagcag cgattattcc tattgttgtg 
atgacccaat acaatcaaag catcgctgaa 
caacaataca accaattgta tcaaggtttt 
atcttaaata tcagcaaatt aacaggggaa 

20 caaattagtg ctgtcaatag tcagattgca 
aatcctagcg ctattgaagc ttatgcgacg 
acggttgaaa tgatgagcgg tatattaggc 
ctagctctac aagagcaact gcgttctcaa 
gattcccttg atagctgtac cgctttaggc 

25 agttgcatgc aaatttctat gactcctatg 
taccaagcgg ttgccactaa agccctaact 
tgccctattg gggacaaggt tcttgccgtt 
ttgagagaat actatataga afcttgtgaaa 
caaatgatac ttaatcaatc acjgattagct 

30 aacataagct cgctatataa ttacaatata 
ttgacttatc ttgattacat caaagacaag 
gaaagggtgc agactaaaat aatcgtgaag 
<212> Type : DNA 
<211> Length. : 1X13 

35 SequenceName : SEQ ID 504 

SequenceDescription : 

Sequence 



40 <2X3> OrganismName : HeXicobacter pylori, strain J99 
<400> PreSequenceString : 

atgactaacg aagccattaa ccaacaacca caaaccgaag cggcttttaa cccgcagcaa 60 

tttatcaata atcttcaagt ggcttttatt aaagttgata atgttgtcgc ttcatttgat X20 

cctaatcaaa aaccaatcgt tgataagaat gatagggata ataggcaagc ttttgagaaa 180 

45 atctcgcagc taagggagga attcgctaat aaagcgatca aaaatcctac caaaaagaat 240 

cagtattttt caagctttat cagtaagagc aatgatttaa tcgacaaaga caatctcatt 3 00 

gatacaggtt cttccataaa gagctttcag aaatttggga ctcagcgtta ccaaattttt 360 

atgaattggg tgtcccatca aaacgatccg tctaaaatca acacccaaaa aatccgaggt 420 

tttatggaaa atatcataca accccctatc tctgatgata aagagaaagc ggagtttttg 480 

50 aggtctgcca aacaagcttt tgcaggaatt atcataggaa accaaatccg atcggatcaa 540 
aaattcatgg gcgtgtttga tcjaatctttg aaagagaggc aagaagcaga aaaaaatgga 60 0 

gagcctaatg gagatcctac tggtggggat tggcttgata tttttttatc atttgtgttt 660 
aacaaaaaac aatcttccga tctcaaagaa acgctcaatc aagaaccagt tcctcatgtc 720 
caaccagatg tagccactac caccactgac atacaaagct taccgcctga agctagggat 780 

55 ttgcttgatg aaaggggtaa tttttctaaa ttcactcttg gcgatatgaa catgttagat 840 
gttgagggag tcgctgacat tgatcctaat tacaagttca accaattatt gatccacaat 900 
aacgctctgt cttctgtgtt aatggggagt cataatggca tagaacctga aaaagtttca 960 

ttgttgtatg gaaacaatgg tggtcctgaa gctaggcatg attggaacgc caccgttggt 102 0 

tataaaaacc aacgaggcga caatgtggct acactcatta atgtgcatat gaaaaatggc 1080 

60 agtgggttag tcatagcagg tggtgagaaa gggattaaca accctagttt ttatctctac 1140 

aaagaagacc aactcacagg ctcacaacga gcattgagtc aagaagagat ccaaaacaaa 12 00 

gtggatttca tggaatttct tgcacaaaat aatgctaaat tagacaactt gagcaagaaa 1260 

gagaaagaaa aattccaaaa tgagattgaa gattttcaaa aagactctaa ggcttattta 1320 

gacgccctag ggaatgatca cattgctttt gtttctaaaa aagacaaaaa acatt'fcagct 1380 

65 ttagttgctg agtttggtaa tggggaattg agctacactc tcaaagatta tgggaa^aaaa 144 0 

gcagataaag ctttagatag gcjaggcaaaa accactcttc aaggtagcct aaaacatgat 1500 

ggcgtgatgt ttgttgatta ttctaatttc aaatacacca acgcctccaa gagtcctgat 1560 



actgtgaata acatcagcat cactcaagtg 1380 

gatgtggatt tcgctgaaga gagcgcgaat 1440 

ggcagctatg cgatgagtca agccaatacc 1500 

tag ^ 1533 



r pylori, strain 0*99 



tcttctgcag gattattatc gcaagtctcc , 60 

gggccatttg caggaatagt agcgggcgct 12 0 

ggatttacta atccgcaaat gaccgctatc 180 

gctgtaagcg tgcctatgaa agccgctaac 240 

aacgatcaaa gcatggctgt ggggaacaat 3 00 

tttaacgcgc aaggcaacac gcaaagcgcg 360 

agcattttag cgagtaacac tacccctaaa 420 

aatcaaatcg ctgttcctag cgtgccaaca 480 

aatattacaa gcgcagcacc aaaatacgcc 540 

gcaagcaaca gctcaatgaa tgatacagcc 600 

gcacttgttg gctcatcaaa agtgtttttc 660 

agtgtttcta tgcccactgt ttatgccaaa 720 

tcaggcgtta atcctatgac cactcctgca 780 

tattgctatg ctgaaaaagt agcagaaatt 84 0 

aacaatacca atttgttgca gaacgcttct 900 

actagcacct atgacactca agcgatttct 960 

gtagcgaata aatctttttt gaaatcgcat 1020 

cttaaggggc aaaaagatag ctacttaaca 10 80 

tga 1113 
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aagggtgtgg gtgctacgaa tggcgtttcc cattfcagaag caggcttfeag caaggtagct 1620 

gtctttaatt tgcctaattt aaataatctc gctatcacta gtgtcgtaag gcaggattta 168 0 

gaggataaac taatcgctaa aggattgtcc ccacaagaag ctaataagct tgtcaaagat 1740 

tttttgagca gcaacaaaga attggttgga aaagctttaa acttcaataa agctgtagct 1800 

gaagctaaaa acacaggcaa ctatgacgag gtgaaacaag ctcagaaaga tcttgaaaaa 18 60 

tctctaaaga aacgagagcg tttggagaaa gatgfcagcga aaaatttgga gagcaaaagc 1920 

ggcaacaaaa ataaaatgga agcaaaatct caagctaaca gccaaaaaga tgagattttt 1980 

gcgttgatca ataaagaggc taatagggat gcaagagcaa tcgcttacgc tcagaatctt 2040 

aaaggcatca aaagggaatt gtctgataaa cttgaaaata tcaacaagga tttgaaagac 2100 

tttagtaaat cttttgatga attcaaaaat ggcaaaaata aggatttcag caaggcagaa 2160 

gaaacactaa aagcccttaa aggctcggtg aaagatttag gtatcaatcc agaatggatt 2220 

tcaaaagttg aaaaccttaa tgcagctttg aatgaattca aaaatggcaa aaataaggat 22 8 0 

ttcagcaagg taacgcaagc aaaaagcgac cttgaaaatt ccattaaaga tgtgatcatc 23 40 

aatcaaaaga taacggataa agttgataat ctcaatcaag cggtatcagt ,ggctaaagca 2400 

acgggtgatt tcagtggggt agagcaagcg ttagccgatc tcaaaaattt ctcaaaggag 2460 

caattggctc aacaagctca aaaaaatgaa gatttcaata ctggaaaaaa ttctgcacta 2520 

taccaatccg ttaagaatgg tgtaaacgga accctagtcg gtaatgggtt atctaaagca 2580 

gaagccacaa ctctttctaa aaacttttcg gacatcaaga aagagttgaa tgcaaaactt 2640 

ggaaatttca ataacaataa caataatgga ctcgaaaaca gcacagaacc catttatact 2700 

caagttgcta aaaaggtaaa agcaaaaatt gaccgactcg atcaaatagc aagtggtttg 27 60 

ggtgatgtag ggcaagcagc gagcttcctt ttgaaaaggc atgataaagt tgatgatctc 2820 

agtaaggtag ggctttcagc taaccatgaa cccatttacg ctacgattga tgatctcggc 28 80 

ggacctttcc ctttgaaaag gcatgataaa gttgatgatc tcagtaaggt agggctttca 2940 

agggagcaaa aattgactca gaaaattgac aatctcaacc aggcggtatc agaagctaaa 30 00 

gcaagtcatt ttgacaacct agatcaaatg atagacaagc tcaaagattc tacaaaaaag 3 060 

aatgttgtga atctatatgt tgaaagtgca aaaaaagtgc ctactagttt gtcagcgaaa 3120 

ttggacaatt acgctactaa cagccacaca cgcattaata gcaatgtcaa aaatggaaca 3180 

atcaatgaaa aagcgaccgg catgctaacg caaaaaaatt ctgagtggct caagctcgtg 3240 

aatgataaga tagttgcgca taatgtggga agtgctcctt tgtcagcgta tgataaaatt 33 00 

ggattcaacc aaaagaatat gaaagattat tctgattcgt tcaagttttc caccaggttg 33 60 

agcaatgccg taaaagacat taagtctggc tttgtgcaat ttttaaccaa tatattttct 3420 

atgggatctt acagcttgat gaaagcaagt gtggaacatg gagtcaaaaa tactaataca 3480 

aaaggtggtt tccaaaaatc ttaa 3504 
<212> Type : DNA 
<211> Length t 3 504 

SequenceNarae = SEQ ID 505 

SequenceDescription r 

Sequence 



<213> OrganismName ; Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaacaa atggtcattt taaggatttt gcatggaaaa aatgcttttt aggcgcgagc 60 

gtggtggctt tattagtggg gtgtagcccg catattattg aaaccaatga agttgctttg 12 0 

aaattgaatt accatccagc tagcgagaaa gttcaagcgt tagatgaaaa gattttactt 18 0 

ttaaggccag ctttccaata cagcgataat attg-crtaaag agtatgaaaa caaattcaag 240 

aatcaaacca cgcttaaagt tgaagagatc ttgcaaaatc agggctataa ggttattaat 300 

gtggatagca gcgataaaga cgatttttct tttgogcaaa aaaaagaagg gtatttggct 360 

gtcgctatga atggcgaaat tgttttacgc cccgatccta aaaggaccat acagaaaaaa 42 0 

tcagaacccg ggttattatt ctccactggt ttggataaaa tggaaggggt tttaatcccg 480 

gctgggtttg tcaaggttac catactagag cctatgagtg gggaatcttt ggattctttt 540 

acgatggatt tgagcgagtt ggacatccaa gaaaaattct taaaaaccac ccattcaagc 60 0 

catagcggag ggttagttag cactatggtt aaggggacgg ataattctaa tgacgcaatt 660 

aagagcgctt tgaataagat ttttgcaagt atcatgcaag aaatggataa gaaactcact 720 

caaaggaatt tagaatctta tcaaaaagac gccaaggaat taaaaaacaa gagaaaccga 780 

taa 7 33 
<212> Type : DNA 
<211> Length : 783 

SequenceName : SEQ ID 506 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgaaatcga agctaaagtt aaaacgttat ttactgtttt taccactttt accgctaggg 60 
acgttgtcac tagccaacac ctacctcctc caagaccaca acaccctcac cccctacacg 120 
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ccctttacga caccgctcaa tggggggctg 
tcatacgaac tcgtggactg aaagcgggtg 
tcagcgttgg tcagggtgaa attccaggac 
caaaatgcct tgagttttga tacccaagaa 
5 ggatcttctg acacttccgg gtctaactcc 
aaagccgcgc ccagggccac gtgggtgttt 
gttaagcagg aaagtcaggg ttccggcgat 
aaaaccctcc aagacctcct cgtcgaacaa 
ttagcccggg tgaatggggt tgctcaggat 

10 agttggaatt cccaacgttc ccaaaaaggc 
accggcttta agctcgataa gggccgcgcg 
tatgaacccc tggattcgac caaggagggg 
tcggaaaaaa caacagcgga aaatgatgcc 
gctggaagtg cttctagttt acaacgcaa,t 

15 ttgagatcag cacctgtcag tgttccacca 
tctaaccccg ctcctgtggg cccacaagcg 
gcagcagtgt ccgtcaatcg cacagcgagt 
accgcccagg ccttgcacca gatgggggtg 
aacaacggta cgggtgtagt ggctagccga 

20 gcggcaggtg cttcccaaac gggtttggga 
gcaacgtcac agcgtgccgt cacggtggtt 
gaaactgatg ccctaccgaa tgtcatcacc 
gcttacttaa atggccagat cgttgtgatg 
tgagttgtcg gggaggacca ggaatcgggc 

25 aactggggca ccgacaagca gaagcagttt 
tcaaattcgg attccaaaaa ttcgaatttg 
ctcatcgccg gtcttgacgt tgtggccgac 
gcggtggggt atgatatgac gactgattcg 
tggtcgacca cggccgggtt ggacagtgat 

30 gccgggctca acggcccgat taatggcttg 
acccccgtga gtgggatgaa aggggggagt 
ccggtcaagt ccgaccaaaa ggccaccgcc 
ctcaacagtt atggggatga tggggtgacc 
tttaagttga acgaggagcg cttgccatcg 

35 gtaaacgaaa gtgaactgaa gtccgcacgg 
tcaaacacca aagtcaagtg aaccaacacc 
tacagtgcca atttccccga agcgggtaac 
aagafctagca ccttggaatc gcaagccact 
ggtaccggtc ttaaagccgg tgttgaccca 

40 tatagtgcag tactactagt gcgtggtggc 
gataaactgt tggattctac tgacaaaaac 
tttgggtctg ctgaaagtgc cgtagacctc 
gaaagtggtc tgtggttcta tacctttgac 
caacaacagg tcaaaaaccg caagggttat 

45 gaatttaacg aggacgctaa taccacaacc 
caaaacggga ttgcttccag tcaggacgac 
ttctccgctg tgattaccaa ggatcaaaca 
accaacgggt tgtttgaaaa ggatgatcag 
ggtttggtcc ctatttacaa cgaaggtatc 

50 gccaacagtg ttttgcaagc gcgtaacctc 
aaccccgata tcctccaaag cttctttaag 
atgctagtgg gggaaaagac atcggatact 
ttggatggta acttctatgg tgaggattcc 
ttcccttccc ggatttttgc tggctttgct 

55 gtcggttcat cggtgggcat tctcttaatc 
ccaatgtata aggtccgcaa gcttcaagac 
gatacgttga caaccgctgt gggtagcgtg 
atcaaaaaag ctcctagtgc gttgaaagct 
aaaccagctg ctccaacagc tccaagacca 

60 

<212> Type : DNA 

<2X1> Length : 3657 

SequenceName : SEQ ID 507 
SequenceDe script ion : 

65 

- Sequence 



gatgtcgfcgc gcgccgccca tttacacccc 180 

ggggatacca agttggtggc gctggtccgc 240 

acaacgagtt cggatcaaag taataccaac 3 00 

tcacagaagg cacttaatgg ctcgcagagt 360 

caagacttcg ccagctatgt cctcatcttt 420 

gaacgcaaga ttaagttggc gttgccctac 48 0 

caaggttcca atggtaaggg ctccctctac 540 

cccgtgaccc cttacacccc gaatgcgggg 600 

acggttcatt ttggttcggg tcaagaatcg 660 

cttaaaaaca accccggacc caaagccgtc 720 

taccggaagc tgaatgaaag ttgaccggtg 780 

aaggggaagg atgagagctc ttggaaaaat 840 

ccgttggtgg ggatggttgg aagtggtgcg 900 

ggctcgaaca gtfccggggtt aaaatcgctc 960 

agcagtacaa gtaatcaaac tttaagctta 1020 

gttgtaagcc aacccgcggg gggtgctacg 1080 

gacaccgcca cctttagcaa gtacctcaac 1140 

attgttccgg ggttggaaaa atgaggtggt 1200 

caggatgcta cttccactaa cctgccccat 1260 

actggtfccgc cccgcgaacc agctttaacc 13 20 

gctggccccc ttcgtgcggg caatagcagt 13 8 0 

cagctctatc atacttcaac cgcccaactc 1440 

ggttccga.cc gggtaccgag tctttggtat 1500 

aaagcgacct gatgagcgaa aaccgagctc 1560 

gtcgaaaa.cc agttggggtt taaagatgac 162 0 

aaggcccaag gcctcaccca acccgcctac 168 0 

cacctcgtct ttgcggcctt taaagcgggc 1740 

agcgcttcga cctacaacca agcactcgcc 1800 

ggggggtaca aggccttggt ggaaaacacg 18 60 

tttaccctgc tcgacacctt tgcgtatgtg 1920 

cagaataatg aagaagtgca aacgacttac 1980 

aaaattgcct ccttaattaa tgccagccca 2040 

gtgtttcjatg ccctgggcct taactttaac 210 0 

cgcacccjacc aactgcttgt gtatgggatt 2160 

gaaaatgccc agtcgacctc cgatgataat 2220 

gcctcgcact acctccccgt gccgtattac 2280 

agaaggcgag cggagcagcg gaatggggtg 2340 

gatggctttg ccaactcgtt acttaacttt 2400 

gctccagtag cacggggtca taaaccgaac 2460 

gttgtaaggt taaactttaa ccccgatact 2520 

agtgaaccta tctccttctc ctatacccca 258 0 

accacgttga aggatgtgac ctatattgct 2640 

aatggtgaaa aaccaacgta cgatggtaaa 2700 

gctgtgatta ccgtatcacg taccggaatt 2760 

ttaagccaag ccccagctgc tttggctgtc 282 0 

ctcacaggca tcctaccgtt atccgatgag 28 8 0 

tggaccgrgta aggttgatat ctataagaac 2940 

ctatcgcjaaa acgtgaagag gcgtgacaac 3000 

gtcgatattt ggggcagagt ggattttgct 3 0 60 

actgataaaa cggttgatga ggtgatcaat 3120 

tttaccccag cctttgataa ccaaagagca 3180 

accttaacgg ttaaaccgaa gattgagtac 3240 

aagattgctg gaattccgct caacattgat 33 0 0 

gctttaccgt cctgggtcat tccggtatca 3360 

ctgctca.tct taggccttgg tattggaatt 3420 

tccagctttg ttgatgtgtt taaaaaggtg 3480 

tacaagaaga ttatcaccca aacgagtgtg 3540 

gctaata.acg ctgctcctaa agcaccagtt 3 600 

ccagtccaac cacctaaaaa ggcttaa 3657 
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<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgcaccaaa ccaaaaaaac tgccttgtcc aagtccactt ggafctctcat cctcaccgcc 60 

accgcctccc tcgcgacggg actcaccgta gtgggacact tcacaagtac caccacgacg 12 0 

5 ctcaagcgcc agcaatttag ctacacccgc cctgacgagg tcgcgctgcg ccacaccaat 180 

gccatcaacc cgcgcttaac cccgtgaacg tatcgtaaca cgagcttttc ctccctcccc 240 

ctcacgggtg aaaatcccgg ggcgtgggcc ttagtgcgcg acaacagcgc taagggcatc 300 

actgccggca gtggcagtca acaaaccacg tatgatccca cccgaaccga agcggctttg 360 

accgcatcaa ccacctttgc gttacgccgg tatgacctcg ccgggcgcgc cttatacgac 420 

10 ctcgattttt cgaagttaaa cccgcaaacg cccacgcgcg accaaaccgg gcagatcacc 480 

tttaacccct ttggcggctt tggtttgagt ggggctgcac cccaacagtg aaacgaggtc 540 

aaaaacaagg tccccgtcga ggtggcgcaa gacccctcca atccctaccg gtttgccgtt 60 0 

ttactcgtgc cgcgcagcgt ggtgtactat gagcagttgc aaagggggtt gggcttacca 660 
cagcagcgaa ccgagagtgg tcaaaatact tccaccaccg ggy^aatgtt tggcttgaag , 720 

15 gtgaagaacg ccgaggcgga caccgcgaag agcaatgaaa aactccaggg cgctgaggcc 78 0 

actggttctt caaccacatc tggatctggc caatccaccc aacgtggggg ttcgtcaggg 840 

gacaccaaag tcaaggcttt aaaaatagag gtgaaaaaga aatcggactc ggaggacaat 90 0 

ggtcagctgc agttagaaaa aaatgatctc gccaacgctc ccattaagcg gagcgaggag 960 

tcgggtcagt ccgtccaact caaggcggac gattttggta ctgccctttc cagttcggga 102 0 

20 tcaggcggca actccaatcc cggttccccc accccctgaa ggccgtggct tgcgactgag 10 8 0 

caaattcaca aggacctccc caaatgatcc gcctcgatcc tgattctgta cgatgcgcct 114 0 

tatgcgcgca accgtaccgc cattgaccgc gttgatcact tggatcccaa ggccatgacc 12 0 0 

gcgaactatc cgcccagttg aagaacgccc aagtgaaacc accacggttt gtgggactga 12 60 

aaggcgcgcg atgttttgct ccaaaccacc gggttcttca acccgcgccg ccaccccgag 13 2 0 

25 tggtttgatg gcgggcagac ggtcgcggat aacgaaaaga ccgggtttga tgtggataac 13 80 

tctgaaaaca ccaagcaggg ctttcaaaag gaagctgact ccgacaagtc ggccccgatc 144 0 

gccctcccgt ttgaagcgta cttcgccaac attggcaacc tcacctggtt cgggcaagcg 15 0 0 

cttttggtgt ttggtggcaa tggccatgtt accaagtcgg cccacaccgc gcctttgagt 15 60 

ataggtgtct ttagggtgcg ctataatgca actggtacca gtgctactgt aactggttga 1620 

30 ccatatgcct tactgtfcctc aggcatggtc aacaaacaaa ctgacgggtt aaaggatcta 16 80 

ccctttaaca ataaccgctg gtttgaatat gtaccacgga tggcagttgc tggcgctaag 1740 

ttcgttggta gggaactcgt tttagcgggt accattacca tgggtgatac cgctaccgta 18 00 

cctcgcttac tgtacgatga acttgaaagc aacctgaact tagtagcgca aggccaaggt 1860 

cttttacgcg aagacttgca actcttcaca ccctacggat gagccaatcg tccggattta 1920 

35 ccaatcgggg cttgaagtag tagtagtagt agtagtcaca acgcacccta ctacttccac 1980 

aataaccccg attgacaaga ccgtccaatc caaaatgtgg ttgatgcctt tattaagccc 2040 

tgagaggaca agaacggtaa ggatgatgcc aaatacatct acccttaccg ttacagtggc 2100 

atgtgagctt gacaggtata caactggtcc aataagctca ctgaccaacc attaagtgct 2160 

gactttgtca atgagaatgc ttaccaacca aactccttgt ttgctgctat tctcaatccg 2220 

40 gaattgttag cagcfccttcc cgacaaggtt aaatacggta aggaaaacga gtttgctgct 2280 

aacgagtacg agcgctttaa ccagaagtta acggtagctc ctacccaagg aacaaactga 2340 

tcccacttct cccccacgct ttcccgtttc tccaccgggt tcaaccttgt ggggtcggtg 2400 

ctcgaccagg tgttggatta tgtgccctgg attgggaatg ggtacaggta tggcaataac 2460 

caccggggcg tggatgatat aaccgcgcct caaaccagcg cggggtcgtc cagcggaatt 2520 

45 agtacgaaca caagtggttc gcgttccttt ctcccgacgt tttccaacat cggcgtcggc 2580 

ctcaaagcga atgtccaagc caccctcggg ggcagtcaga cgatgattac aggcggttcg 2640 

cctcgaagaa ccctcgacca agccaacctc cagctctgaa cgggggcggg gtgaaggaat 2700 

gataaggctt caagtggaca aagtgacgaa aaccacacca agttcacgag cgctacgggg 2760 

atggaccagc agggacaatc aggtacctcc gcggggaatc ccgactcgtt aaagcaggat 282 0 

50 aatattagta agagtgggga tagtttaacc acgcaggacg gcaatgcgat cgatcaacaa 2 88 0 

gaggccacca actacaccaa cctccccccc aacctcaccc ccaccgctga ttgaccgaac 294 0 

gcgctgtcat tcaccaacaa gaacaacgcg cagcgcgccc agctcttcct ccgcggcttg 3 00 0 

ttgggcagca tcccggtgtt ggtgaatcga agtgggtccg afctccaacaa attccaagcc 3060 

accgaccaaa aatggtccta caccgactta cattcggacc aaaccaaact gaacctcccc 312 0 

55 gcttacggtg aggtgaatgg gttgttgaat ccggcgttgg tggaaaccta ttttgggaac 318 0 

acgcgagcgg gtggttcggg gtccaacacg accagttcac ccggtatcgg ttttaaaatt 324 0 

cccgaacaaa ataatgattc caaagccacc ctgatcaccc ccgggttggc ttgaacgccc 33 0 0 

caggacgtcg gtaacctcgt tgtcagtggc accacggtga gcttccagct cggcgggtgg 3360 

ctggtcacct tcacggactt tgtcaaaccc cgcgcgggtt acctcggtct ccagttaacg 3420 

60 ggcttggatg caagtgatgc gacgcagcgc gccctcattt gggccccccg gccctgagcg 3480 

gcctttcgtg gcagttgggt caaccggttg ggccgcgtgg agagtgtgtg ggatttgaag 3540 

ggggtgtggg cggatcaagc tcagtccgac tcgcaaggat ctaccaccac cgcaacaagg 3600 

aacgccttac cggagcaccc gaatgctttg gcctttcagg tgagtgtggt ggaagcgagt 3660 

gcttacaagc caaacacgag ctccggccaa acccaatcca ctaacagttc cccctacctg 3720 

65 cacttggtga agcctaagaa agttacccaa tccgacaagt tagacgacga tcttaaaaac 3780 

ctgttggacc ccaaccaggt tcgcaccaag ctgcgccaaa gctttggtac agaccattcc 3840 

acccagcccc agccccaatc gctcaaaaca acgacaccgg tatttgggac gagtagtggt 3900 
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aacctcagta gtgtgcttag tggtgggggt 
tctggcgtgg atctctcccc cgttgaaaaa 
agcacgagtg acggaaacac ctcctccacc 
aatgatgtgg tgggggttgg tcgactttct 
5 gttgatggta ttgtacgcac cccactcgct 
gacactggtc cacaaagcgt gaagttcaag 
tttacccacc cagtcaccga tctgtttgat 
ataccgctgt ttattgatat cccagcaagt 
ttgagctttg acaccaacga acagagctta 

10 caagataccc aaccaaacaa caacgttcag 
ctgttaacgg cctccagtca aggtccccaa 
gattacgtgt tgccgttagc gatcactgta 
ttaggacttg ccattggaat cccaatgcac 
gcgctatcaa accaaaaggt tgafcgtgttg 

15 atcattaacc gcacaggtat cagtcaagcg 
aaaccaggag caccccgccc accagtacca 
caacgaccta aaaaacccgc ttag 
<212> Type : DNA 
<211> Length : 4884 

20 SequenceName : SEQ ID 508 

SequenceDescription : 



gctggagggg gttcttcagg ctcaggtcaa 3960 

gtgagtgggt ggcttgtggg gcagttacca 4020 

aacaacctcg cgcctaatac taatacgggg 4080 

gaaagcaacg ccgcaaagat: gaatgacgat 4140 

gaactgttag atggggaagg- acaaacagct 42 00 

tctcctgacc aaattgactt caaccgcttg 42 60 

ccggtaacta tgttggtgta. tgaccagtac 4320 

gtgaacccta aaatggttccg tttaaaggtc 43 80 

ggtctccgct tagagttctt taaacctgat 4440 

gtcaatccga ataacggtgst cttcttacca 45 00 

accttgttta gtccgtttaa. ccagtgacct 45 60 

cctattgttg tgattgtgc t cagtgttacc 462 0 

aagaacaaac aggccttgaa. ggctgggttt 4680 

accaaagcgg ttggtagtgt ctttaaggaa ,,4740 

ccaaaacgct tgaaacaaao cagtgcggct 48 0 0 

ccaaagccag gggctcctaa. gccaccagtg 48 60 

4 8 84 



Sequence 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgggttata agttaaagcg atgaccttta gttgcgttta 
ggggtggtgc tagcggcgtg ttctgcactc aatacctcca 
cgctccaagc agttgattgg gtttaccgaa aacaacatca 
aaagccgctt tagcfcgagga caatggtacg gaaaccattt 
gcgttaaaga gttggtacca aaacaataag gatcgcaaca 
tttagtgaaa acgtggagga tgaacatgat aacctgttgg 
ccgattaatt gaccaataga gctccaaaag gaatacgacc 
tcctgaaagg ccttaaagtt atacgaccgt ttaatagccg 
agcaacattg ttgctaatgt gcagttgact gatggtagtg 
aaagataacc tcgacagtac tagcaacaaa atcaagtttg 
cctaacgggg agttcfcttgc taaccfcccag gcttatctat 
gaaaaccccc tccccctcac ccaagccttc ttcgcgtacc 
gacagcttat acgaccaagc cgcgattggt agtgccttac 
gcgtttcgcg agcccaataa fcggacaaagc caaggtaaaa 
aacagtgccc aaaacttcgg ggatttcatt aaggcggtgt 
caaacccaac aatccaatac atcctcccgc accgggttgt 
aacaccaatg gcgctgccaa taagctcttg gtgaccaagt 
aagggcgtgg ggttagctac ggccattatt gaccagtacg 
aaaacgagtt cgttgcccga ggtcaaggta gattcgaata 
gatagttttt ttatggaggg gaaggatgcg gtggcgatta 
aaaattgcca tgaccgacca aacccccggg tttaaggtca 
aagcagagtc agcagaacga taccttttac caaaaccagc 
agtggggata ataactccca gggtaaacac cactacctcc 
agtagtcaag caatggcggc agcttctact ggggcagatt 
ggtggtagct ccgggggcaa ctctgtttta attccgctcc 
cacacccagc aacaagttca acaaaccacc tccacactcc 
ggtgacgatg gtacatacgc cctggcaatt gacggtggtg 
aaacgcgatt tcaccaaaca agccgacatc ttgctgtacc 
aataacttta aggaaaacgg cgtcgagttt agtttgaact 
ttccaaacgt gagcacaaac ggggttaacc gcgaagttgt 
atgggttcgg gtcaggggac ccaagttaag ggcagtgtgc 
tctgttagcg ttcaaactac ccagcaaaac agacagcaat 
gaagtggtta agttagccaa gtccctgctg aaatcaagtg 
actgataacc ccacctttaa aaaggcctta accgacatcc 
ttggctgctg cggggaagtt aagtgagttt aagaaagatt 
cagcaagcta ttattgaccg cgctgataag tacattcaac 
agtgcgattg gcttgggcca accgctccct taccaacgcg 



gctttagaga agttctttat 
gaatcaggat ctgccgcttt 



tccggaagat agtgcagcag 
agtaacacta aagacaacag 



accgttaaac aaccggacat caaaccgacc cgggaaaaca 
ttaactagtg atgtagaaac taaagcatct agtttaatca 



catttaccgg 

atctgtttcc 

ttaaacctgja 

taagggfctaa 

ttgctacccrcj 

accaaaaaca 

agtggggtgg 

acttccaatc 

atcagtttaa 

ttaattccaa 

ttgcccagfcg 

aagcacccaa 

agttgggcfca 

ccaccttcga 

ttccggagca 

ttgactggca 

ccaacctgcg 

agtacctgcjt 

agtccaacca 

gatccafcag-t 

acccggcgtit 

gcaaactgag 

aagacgcggjfc 

ctagtagtcgg 

caaggagtgrc 

aaacccctgt 

attatttttt 

gctatttaca 

tactggaatc 

acggtgctfct 

aggggagtag 

cgactgatac 

cggattfcggc 

aatccgagfca 

tgggggaggt 

tagaaaaaca 

ctagtgatgg 

atggaaaacgt 

atagtcaaaa 

acgacaaaaa 

ccaaatga^g 



aattggccta 

ccgtcaaaac 

agctgtacta 

ctttggtgaa 

tttaactatc 

acaggccgaa 

gagtgaaagt 

acttatcttt 

acccactacc 

accaaatgat 

agtagtggag 

ggacgggctc 

tgccttcccc 

ccccaccccc 

gaagaafcggt 

aaccaagtga 

cggggccttt 

gggtggaagt 

aaatccgctg 

ttcgcgtgcg 

tgtcaaggtg 

cggcgggcaa 

gcggttgacg 

taccaatgtt 

cgcccttacc 

ttacgcccgt 

ggcgaacaac 

agctaagagt 

tggcagcctg 

ggtggcgatg 

ccgagcagct 

ccaagaatcg 

caaacccttc 

caaggattat 

gagtggctta 

agcccaaaag 

tagttacccc 

aaaagcatct 

aagcacaaac 

gttaaagcag 

tgcaacaccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 " 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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caaattggta gtcagttctc cgaaattgtt 

accaacatga tcttggcctt attgtctgat 

tcctttaagg agtggttctt tactaatacc 

aagaaagaac ttaaaggcaa tgagtacaag 

5 tacttacgtt cctgacaacg actcacttcc 

ggtagtgtaa aagcacaagc agctcaatca 

gttgcaaatg ctgtagcttc aagtggcatg 

ctcggtaaaa aagcgtttga aagtgaactg 

tacctgcgct tcttgtccac cttaatgtga 

10 cgcttgctcc aacaagcgat taccgtgggt 

tatgatgata cggctacggc atcagcagcc 

acagctcagg caacaaatac tcaaagcgac 

gattatgtcc agggttcgga aaccaactga 

gattcgt.tac tggaaagcga aagcacttac 

15 acgaaaagtc aaaagaggtc tactggtggt 

aacggtttaa cgattaactc gccccaaagt 

caaatcttta acaactttgg tcagttagta 

cagtacaagg ataaagcaac cctgaaacgc 

ctcaatgcct ttggtgaagt gttgcaccgt 

20 cgttttaact ccagtgggga accgttgatt 

gatgtagtgg acaaactgga tgatgtttac 

accaaggtga agatgtctga ttcaagtagt 

ccaaagccgc accactcacc aaggactcga 

ttaccaacga ggacgttaac aaaatttctg 

25 tag 

<212> Type : DNA 
<211> Length : 3903 

SequenceName : SEQ ID 509 
SequenceDescriptiorx : 

30 

Sequence 



tcgctcaaga gtaaggacaa taagccccag 252 0 

gttgggatta agtgaaccaa gattctcaat 258 0 

aacgacttta agaacaatta cgattctgaa 2640 

gactttaacg acctcgttaa acaaaccttg 2700 

aaggaaaagt fctggttatta caaggagttg 2760 

ggaatggttt cactttcaag tagcgcagca 2 820 

caaaaatctg gtgatcaaac cctcttggaa 2880 

gaagcatcta gtagtgatgg gcagtacaag 2940 

ctagttaagg atggggcgaa gaactataag 3000 

acccgtgcct fctgtgtcgtg aacggtgagc 30 60 

gcagcaaaag cgcaagtagc agtattaaag 3120 

aaccccttca ataagtttgfc ccaaaacccc 318 0 

tttaacgata agagtacgcc aattaaaccc 3240 

aactttaccg cggaaccgtt .tgatgataag . .3 3 00 

accactaacg aaaaacactt ctttggcttt 33 60 

gtgtcgactg ctagtgccgg actcacggaa 3420 

acaagtagtg acaaatccgg ggccttgagc 34 8 0 

ttaatccaaa acactaattc cgatgccgag 3 54 0 

gcagttaatg fccgataccag taacttagga 3 600 

agctttgaca acaagaagaa gttcttggtt 3660 

tttaacaagt tcgaaggtta tgttggccaa 372 0 

agttcccaag cgtacgaagac tataagaaag 3780 

gtgtcaagac tttgagctat gtcattcaga 3 84 0 

ctagttgaaa aactgataag aacagtgctt 3 900 

3903 



<213> OrganismName : Mycoplasma ] 
<400> PreSequenceString r 

35 atgaaaaaac ttctaataaa accccagttt 
tcaagtgtta ttttagttgc ctgtgctacc 
aaagcacgtt cgaaccagtt cttcaacggg 
actgctttaa aagatccaga ggccaacaaa 
ttgacagctt ggtatgaaaa taaccaagac 

40 aaaaagagtg ttgatgagca atacaaccaa 
aacaaaaacc tctttgtcca acaagacttg 
aaaagcccag aagttgtttg aacagctcat 
<212> Type : DNA 
<211> Length : 453 

45 SequenceName : SEQ ID 510 

SequenceDescription : 



tgattcttaa cccttggtgg ctttatcfcct 60 

ccttccaatt cagcactcca aactgttttt 12 0 

gaacagggta gtttacaaaa cgctttagcc 180 

cagtttgtcg ctgctcccct tttaaaggca 240 

aaacaggtca cccagttctt taaagacacc 3 00 

gccgtagata aggtggtatc ggctagccga 360 

ctggatagcg ccgggggggt tagaaacttg 42 0 

taa 453 



Sequence 



50 <213> OrganismName : Mycoplasma ] 
<4 00> PreSequenceString : 
atgcaacagc aaggagaaac taaagatcaa 
aacagtgttg gcgtatcagt tttaggactt 
agcggaggta gcaatggcgg ttctagtagt 

55 aaattcttaa agttccgtgc ttttcaagcc 
gcctttagtt ttcccctaaa cgaaacttta 
atcttagcga acgccttagt taaagttact 
ttagtggatg cttttagttc ttataaaaac 
gctaccacca tgattagctt ttattttgac 

60 gaacgagtac ggagcttaaa ccaaaatgtt 
gggttatcgg ctaaactacc ttacgttaac 
tactttacct ttttaattac caaagttttg 
gtcagtgaag aaaaaagcaa actcaaaact 
aagattttga acaacatcga cagtaagctc 

65 ttagcacccc gaccagctta cagtaacgta 
tggtctgctg gcgctaattg aagtttggca 
ctttcgttta tgttgctcaa acaaatgttt 



tataacactt ttggcctgag acttgtgcgt 60 

gatggttttg ttaaattcat aaaaggcggt 120 

gccaaaaaga ttgataaaga agagcaaaag 180 

aaaattggca cattttataa cactaacttt 240 

aaaggttggt ttgacaaaca ccgcggattg 3 00 

ttagacacaa aggaaaaagc aagtaaggcc 3 60 

tgattgagtg aatacacccc ggttggttta 420 

caaatgaaag ctctcaacaa taagctgtta 480 

aaccaagcca atcctacccc ttggttaaat 540 

actaacggta attacgaaaa gttaaacaac 600 

tgacctaagg tgggcactga ggatactaat 660 

aaaactgaag atgttaacaa gattagggaa 720 

aaaacttttg tccaaaaact caaacctacc 780 

atcttgttaa acattaacaa tgataaggtg 840 

gttttactag atccgaaaaa ggttaacccg 900 

gatcaaaaca gtttgtttaa aaaagcaaag 960 
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actttattcg aaaacattca aaataaagca aaaactagtg gaagtggtaa aagtgg-taca 102 0 

accaccaacg atgatgccga tgctttgagc aaagtcattg gcaactatta ctacaacact 108 0 

tgagctaagc taaccgataa atcaatttat ggcaacctta aggatgacaa atttga.tgat 114 0 

ctctttaaat tggcttttga cagtagtatt aacgaaaagt cctttaatgt agatta_taag 120 0 

gcagtgattg aacactaccg ctttatctat accttagagt ggttggtaga caaaaaccta 1260 

aagaacttca aggatttatt aaaggcaaac ctcaagtttg gtgaaattgc ttttafcfcgct 132 0 

tacaagaata ctgaaacgca gaacttctct aacccgcaag gtatattcgg ttctta_cttt 13 8 0 

aattacgaaa acgaaacgaa tgcggctaag agtgctacgc aaattataga ccccaacagt 144 0 

ttcttctata aaaccaccac taaaccagaa gcgaaaacca ctcaaagtgc taatacragct 15 0 0 

gtgatggttc aaaatacgca gatgaacaat cagcaaacta acagttatgg ctttacfcggt 1560 

ttgagtacca gtagtggttc gatgttaggt gctgctaccc agcaagccat tttgga_tcaa 162 0 

ataaccaaaa cttccttgca acagtatggt tctcaagctg acctgaaaaa gatcattggc 168 0 

gaaactaaaa atcaattatt attagaccga attgccaacc aactaatagc cttaaaaccg 174 0 

aatacaagtg. gcaacagtgg tacccaaaaa acaattgctg cafeacfctcca aacagatgcg 1800 

gttggcaatc ctactttgga ctttaaagcg aagcaaaaac tcttattgga tgttttagat 18 60 

caatacaaag atttctttgg taataatgca caagcagttc aaagagattc tggtaa.gagt 192 0 

ggaactggca actatttaac ctataccgac ggtagtgata agatcactta tttgca-gttt 198 0 

tcctataaag atattgacgg tttaagtttg agtagttcaa acggaactag cagcaa.attt 2 04 0 

gccagtgatg ttgtagcagc gcttttatta ttccaggcag cctataaagg tactcaacaa 210 0 

ctggctttaa gttccatcaa taaaccacaa ttaccaattg gcgataaacg cataaaaaca 2160 

gggatcgatt tactgaaata g 2181 
<212> Type : DNA 
<211> Length : 2181 

SequenceName : SEQ ID 511 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgaagagtt ttttaagaaa acccaagttt tgacttttat tgttgggtgg cctttccact 60 

agtagtatta ttctcagtgc ctgtgcaaca ccttcaaact cagcactgca agcagttttt 120 

aaaccaactt ccaaccaatt ttttaacggt gagcatggta ccattcaaag cgctttaaac 180 

accgctttaa gagatccgga aactaacaaa aaatttgtag ctgctccact tttaaaagca 240 

ttagaagctt ggtacgagaa taatcaagat aaaaacatta ctcaattctt aaaagacacc 3 00 

aagactaatg ttgataacca atacaaaacc gttgtagata aagtagtttc agcaccacgc 360 

aataaatcct tatttgtaca acaagattta ttggacagta gcggtggtag tgaggcrcact 420 

tgaaaagcgc gcaaactgtt tgaacagcta attagtgatt ttgcttcacg agtttfctcaa 480 

aagaattatt tgtcttataa ggaaaatggc aaagtatcag ccggtccgtt tttatacgac 540 

acgatttcaa aaaatagtaa ttgacagaat atagtttttg atgctgttaa ttttccragaa 600 

actaatgatg attttttcgc gaagattcaa agcgaagttt ttgatcaatg ggctgagtac 660 

accgatccaa ctattattag ctcagttacc ttaaagtatt cagctcctaa ttaa 714 

<212> Type : DNA 
<211> Length : 714 

SequenceName : SEQ ID 512 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

atgattaact tcttgtttaa ccaaatgaac gcgttgaaca ataagttttt ggagcgcgct 60 

aaagcattaa accaaaatgt gaatcaggcc aatcctacac cttggttaaa tggtttatca 120 

gctaaattgc cttatgttag aacaaatggt aattacgaaa aactcaacaa ttacttcact 180 

ttcttaattg ttaaatacat gtggaagaag gtcggtaatg aagacgcttc attatctaaa 240 

gatagtagca ttaacaagct caaaactaag accgaagacg ttaacaaaat tagagataaa 3 00 

atcttagaag acattcaaaa gaaggttcaa gaatttgtta aaaacaagct taaaccaacc 3 60 

ttagcaccac gacaaactta cagtaatgta attttgttaa acgttaacaa cgataaggtt 420 

tgatcgatgg gcgccaattg agctttagct aacttattgg atacgagcaa aattaatcca 480 

ctttcattta tgttgctcaa acaaacgttt gaccaaaacg acttgtttaa gaaagctaag 540 

aaactttttg aagatattca aagtaaaaca aatggtggaa gttcaggtgg gatgcasiggt 600 

agcaatacct cgagtagtga aggagctgat gccttgagta aagtaattgg caactattac 660 

tacaacagtt gagctaaatt gactgataaa tccatttatg gaaatcctaa ggacaacaaa 720 

tttgatgact tatttaaatt agcttttgaa gatagtatta acgaaaagtc ctttaatgtt 780 

gattacaaag cggtcattga acactaccgc tttatctata ccttggaatg actagttaac 840 

ggtaacttaa agaactttaa ggacttatta aaagcaaatt taaagtttgg tgaaattgct 900 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



ttcattgctt ataaaaacac tgaaaccaaa 
tctgctttta attacgaaaa cgaaactaat 
ccaaataatt tcttttataa aacaactacc 
ggtgcaagca tgatggttat gcaacaaaaa 
tatggtttta ccgggctaaa caccagtact 
gccattttgg atcaaataac caagaattcc 
aaaactttga ttgaaaagac taataaccaa 
agtggattaa atccttcaac cactggaaat 
acttatttcc aattagatgc cattggcaac 
cttttattag atgttttaga tcagtacaaa 
caaagagatt ctggtaaagg tggacatgga 
aagatcactt accttcagtt ctcctataaa 
ggaaatagta agcttgctag cgatgttgtg 
aaaggtaccc aacaattagc cttgagtgct 
<212> Type : DNA 
<211> Length : 1719 

SequenceName .- SEQ ID 513 
SequenceDescription : 

Sequence 



gaattttcca 
gaagttaaga 
aaaccagaag 
atgcaaagta 
agttcaatgt 
ttgcaacaat 
cttttattag 
agtaataatg 
ccaaccctta 
gatttctttg 
agctattcaa 
gatatagata 
gctgcccttt 
attaattaa 



atccgcaagg 
ttgctgcgca 
aagttaaaac 
ccatgcaaga 
taggtgccgc 
atggttcgca 
accgaatcgc 
gaaagggtaa 
gctttcaaca 
gtacaaatac 
cttaccaaga 
acttaagttt 
tacttttcca 



tgtattfcggt 
aaacttagac 
cgcacaaaat 
tagtaaccat 
tacgcagcaa 
gcaagaactg 
tagtcaatta 
aaatattgct 
aaaacgaaaa 
tcaggctgca 
tggcagt<gac 
gagtgataaa 
agcagcggat 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgaaaaagt ttctaagaaa accccaattt tgactcttaa 
actagtgtta ttttagctgc ttgtgctacc ccttccaact 
aaagcgcgtt ccagtcagtt tttcaatggt gaacagggta 
acggctttaa aaaatccagt ggccaacaaa caatttatcg 
ctagaagctt ggtacgaaaa caacgaagac aaaaagatta 
aagtccaatg ttgacagtca gtacacaacg gcagtcgata 
aataaatcac tttttgtgca acaagatttg ttggataacg 
tggaaagcgc aaaagctgct tgaacagctc attagtgact 
aagaactacc tcaattacaa aaaagatgga caagtttcta 
gaactacaca aggaagaaag ctgaaaaaac tttgaattta 
actaatgatg actttttcgc caaaattcaa agccaagtat 
actgatccca ctttaattag tcaagttaac tataagtatt 
ggtcagatct ataacagaga gaagttaaag gataaactaa 
ttctttgccg aagaaaaaga cattgcaccc aaccaaaacg 
cagttagtta aaggtgaggg cgctattact gataataata 
agccaaaaaa ctggtctgct caaataccgt aatgaatcaa 
tttcccttaa atttatcaga tactaacgaa acgaagcaat 
gtcgatcagt tagaagctgc taacttaggt gcagctttaa 
gagcaagata atgacgaatt gccgcaaatt aaggagctca 
attgttgtcg ataaaagtaa ggacgtagaa aaagcttcca 
aatgatcaag aaggtaagca acaacaaagt gactcagatc 
gacatttttg ctcaaaacac aagcgaaggc actaacttaa 
aagaaagcag ctgcaacaaa aatggaagcg aaaacagctg 
aagggccaac aaaacaatta cgttgtttta gatgcagcta 
acatcaaaat cgaaaaataa tagcgcttct aatgaagttt 
tctataaatt taaggcaggt tcagcaaact gatcaaaaca 
cggattgtgc gtaacagcac cggagtaact gttttcggcc 
ttaaaacaag attcaacaaa taaaaaatca gtttctaagc 
aaatctagtt caggtaacag taataaagta ttaagagacc 
ttaaagtttc gtgcctttca agctaaaact aacactttct 
agttttccct taaacgaaac gctaaaaagt tggtttgata 
gccaacgcct tagttaacgc tagtcttgac caaaaggata 
gaagctttta atccttataa agagttaatt aaagaatttg 
acaatgatca gcttttattt tgatcaaatg aaagcgctca 
gcccgtaatc ttaaccaaaa cgtcaaccaa gccaatccaa 
tcagccaagt taccttacgt taatactaac ggtaattacg 
accttcttaa tcacaaaaac attgtgacct aaagttggtc 
gaagaaagca ataagctcaa aactaaaact gctgacgttg 
ttggaaaaca tccaaactaa agtaaatgat tttgttaaaa 
gcaccccgac cagcttacag taacgtcatt ttgttaaacg 
tctagtggtg ctaactggag cttagctagc ttattacaga 
tcgtttatgt tgctcaaaca agcgttcgat aataacgatt 
ttgtttaaag acattcagga aaagtccagt aataatggtg 
accaacagtg atgccgatgc attaagtaaa gttatcggta 



cattaggcgg 
cggctttaca 
gtttacaaag 
ctgcaccact 
cccagttttt 
aagtggtatc 
ccggtggtag 
ttgctagtcg 
ctggtccatt 
gtgccccacg 
ttgaccaatg 
ctgctcccag 
caccttctta 
ttggtaataa 
tcggtcaaag 
ataagggtga 
tagtagacgc 
atttaaaact 
aagaagacct 
aaactaacgc 
caattgctgg 
gtaagttagc 
ttttaagaac 
ttcctacatt 
tagttgcctt 
gttacagtcc 
ttgatggtgg 
aaagtttaac 
ttgacaagca 
acagtactaa 
aacaccggga 
aagctagcaa 
cgccggtggc 
ataacaagtt 
ccccttggtt 
aaaagttaaa 
aagaagaaac 
acaagattag 
acaaactcaa 
ttaacaatga 
gcgacaaagt 
tatttaagaa 
gaatgcaaag 
actattacta 



ttttctgtct 
aaccgttttt 
tgccctaaca 
tttaaaagca 
aaaagacact 
agcatcacgc 
tgaagcaacc 
ggttttccaa 
tacttatgat 
ttttagfc<gaa 
ggtggagtac 
tcaaggg-tta 
tgcctttcct 
gcgctgaaag 
cggtacaaac 
ttttcttgat 
ttctaacatt 
gcaagttttt 
taacaacacg 
actgttttac 
cgctttacjat 
ggagcag-gta 
taacaattct 
taattcaaca 
aaaatctgga 
cattaaattc 
gagctattat 
cttattaact 
aaaacaattc 
ctttgccttt 
actaattttg 
agctttaact 
tttagcaacc 
gctagagcgc 
gaacggtttg 
caactacttt 
aagcataagt 
ggacaaaatc 
acctgct tta 
caaagtactt 
caacccgctt 
agcacaaaag 
tagttccaca 
cactacttgg 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
17 IB 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
138Q 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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gctaaactca cggacaaatc gatttacggt aacccgaagg acaacaagtt tgatgagctt 270 0 

ttcaaactgg cttttgaagc tagcatcgac gaaaagtcct ttaacgttga ttacaaagcg 2760 

gtcattgacc attaccgttt tatctatacc ttacagtggc tagtggacca aaaattaaag 282 0 

aactttaagt cactgttaaa aacaaacctt aagtttggtg aagttgcttt tatagcttac 28 8 0 

5 aagaacactg aaaccactaa cttctctaat ccccaaggcg tatttggttc ctacttcaac 294 0 

tatgagaact ccgctagcga agttaaagaa tccactcaaa cactagatcc caataacttc 3 00 0 

ttttacaaaa caacaaccaa gccaacggta caagccattc aacaagttgc tagcfctagca 3 060 

ttagtacaaa aacaacaaat gcaacaaaat tcaactgatc actatggttt tactggtttg 312 0 

agtaccagca ctagttcgat gttcgatgct agttcccgtg atgccatttt gcagcaaatt 318 0 

10 acgaaaacct ccttgcaaca gtacggttcc aaggatcaac tcaaaaagat cattcaaggg 3 240 

actaataacc aattgttact agaccggatt gcagtccagt taagtggatt aaatccttcc 33 00 

acaactaatg gtggcagtgg caaaacaatt gcgacctact tccaagtgga tgccgttggc 33 60 

aacccaactt tggacttcca agctaagcgt aaactgttat tagacctctt ggaccagtac 342 0 

caaaactact ttggtaatgg tgcccaaaag agccaaaggg rifcte^fcactce aagtggaact 3480 

15 ggcaactacc tcacttacca aaatggtagt gacaagtaca cctacaccca gttcacttac 3 540 

caagacattg acagcttgag tctaacaact acaagcggta ccaacaataa aattgccagt 3 60 0 

gatgttgtgg cagcgctgct tttattccaa gcagcagaca agggcacaca acagttggca 3660 

cttagtgcca ttaataagcc gcagttaaat attggtgata aacggataga aagtgggtta 3720 

aaattgctca aatag 3735 

20 <212> Type : DNA 

<211> Length : 3735 

SequenceName : SEQ ID 514 
SequenceDe script ion ; 

25 Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

atggttggaa gtggtgcggc tggaagtgct tctagtttac aaggcaatgg ctcgaacagt 60 

30 tcggggttaa aatcgctctt gagatcagca cctgtcagtg ttccaccaag cagtacaagt 12 0 

aatcaaactt taagcttatc taaccccgct cctgtaggcc cacaagcggt tgtaagccaa 180 

cccgcggggg gtgctacggc agcagtgtcc gtcaatcgca cagcgagtga caccgccacc 240 

tttagcaagt acctcaacac cgcccaggcc ttgcaccaga tgggggtgat tgttcrcgggg 3 00 

ttggaaaaat gaggtggtaa caacggtacg ggtgtagtgg ctagccgacg ggatgctact 3 60 

35 tccactaacc tgccccatgc ggcaggtgct tcccaaacgg gtttgggaac tggtfccgccc 420 

cgcgaaccag ctttaaccgc aacgtcacag cgtgccgtca cggtggttgc tggccccctt 480 

cgtgcgggca atagcagtga aactgatgcc ctaccgaatg tcatcaccca get ct at cat 540 

acttcaaccg cccaactcgc ttacttaaat ggecagateg ttgtgatgag ttccgcccgg 600 

gtaccgagtc tttggtattg agttgtcggg gaggaccagg aatcgggcaa agegacctga 660 

40 tgagcgaaaa ccgagctcaa ctggggcacc gacaagcaga agcagtttgt cgaaaaccag 720 

ttggggttta aagatgactc aaatteggat tccaaaaatt cgaatttgaa gacccaaggc 78 0 

ctcacccaac ccgcctacct catcgccggt cttgacgttg tggccgacca cctccjtcttt 840 

geggcattta aagegggege ggtggggtat gatatgacga ctgattcgaa cgcttcgacc 900 

tacaaccaag cactcgtctg gtcgaccacg gccgggttgg acagtgatgg ggggacaagg 960 

45 ctttggtag 969 
<212> Type : DNA 
<211> Length : 969 

SequenceName : SEQ ID 515 
SequenceDe script ion : 

50 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

55 atgccagttt ttctaaaatt aacgcacaca attagaaaag tgctaagagt tgeaagaett 60 

tctagactag cattactatc actaaccgct gttattttta gcggttgtgc caatattaat 120 

ttaattagtg ctgttggttc ttcttcggta caaccgttgt taagcaaact cagttcgcac 180 

tatgtcttga accacaatga caaggataac cttgtagaaa ttagtgtcca agcgggtggc 240 

tctagtgctg gggtgaaagc aatcaccaag ggactagctg acattggtaa tgtctcgaaa 3 00 

60 aacaccaaga getatgetga ggaaaacaag cagttgtgga tggacaaaaa gctcaaaaca 3 60 

attacacttg geaaagatge cattgctgtt atttataaag caccatcaga gtttaagggc 420 

aaactagttc tcactaaaga caacctcaac gatctttacg atctgttcgc tggtagcaaa 48 0 

agegttgaca ttaataagtt tgtcgaaaac ggacaaacca ctaaaaacag taatcataat 540 

ttgataggct tcccccgtac tgggggeget tttgcttctg gtaccgctga agctttcttg 60 0 

65 aagttttcgg gtcttacaca gacaaaaact ttagataaag attccaaaga aattttggaa 660 

ggtcaacgea actatggccc caatgegega ccaactagtg aaaccaacat tgaggccttt 72 0 

aatacctttg tcacaacttt gcgacaaccc aatttatacg gcatggtgta cctcagttta 780 
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gggtttgtga ataacaacat gaacctaatt aagagtgaag gttttgaggt tttaaaagtc 840 
aaatatgata ataacgcagt taccccctcc agtcaagcag tttctagcaa cacttacaaa 900 
tgggtacgcc cgttgaactc agtggtttcc ctgttaccaa aacaaaaaac actgccaagt 960 
atccaacgct tttttaactg attgttattt agcaacaaca gtgaaattaa gaaaatatac 1020 
5 gatgactttg gtgtgttaga gttaacggct gacgaaaaga aaaagatgtt taaaacaggt 1080 
aatgcagaaa tgagcaacat tgccaacttc tcjggttgatg attacagtct gaacaaccaa 1140 
accttcggtg cactctag 1158 
<212> Type : DNA 
<211> Length : 1158 
10 SequenceName : SEQ ID 516 

SequenceDescription : 

Sequence 



15 <213> OrganismName : Mycobacterium, tuberculosis H3 7Rv 
<4 00> PreSequenceString : 

atgagcttcg cggtgctgcc cccggagatc a_attcggcgc gcctgtacgt gggtgccggg 60 

ttggcgccga tgctggacgc ggcggccgct tcjggatggac tggccgacga attgggttcg 12 0 

gccgcggcct cgttttcggc ggtgacggcg g-cjgctggcag gttcctcgtg gctgggcgcg 180 

20 gcgtcgacgg cgatgacggg agcggccgcc ccctatctgg gctggttgag cgcggcggcg 240 

gcgcaggccc agcaggcggc cacccaaacc ccjgctggcgg cggccgcctt cgaggcagcc 3 00 

ctggcggcga cggtacatcc ggcgatcatc tcggccaacc gggcactgtt cgtgtcgctg 3 60 

gtggtctcga acctgctggg ccaaaacgcc crcggcgatcg cggccaccga ggccgcctac 420 

gagcagatgt gggcccagga cgtggcggcg a_tgtttggct accatgccgg ggcttcggcg 480 

25 gccgtctcgg cgttgacacc gttcggccag cjcgctgccga ccgtggcggg cggcggtgcg 540 

ctggtcagcg cggccgcggc tcaggtgacc a_cgcgggtct tccgcaacct gggcttggcg 600 

aacgtcggcg agggcaacgt cggcaacggt a_atgtcggga acttcaatct cggctcggcc 660 

aacatcggca acggcaacat cggcagcggc aacatcggca gctccaacat cgggtttggc 72 0 

aacgtgggtc ctgggttgac cgcagcgctg aacaacatcg gtttcggcaa caccggcagc 780 

30 aacaacatcg ggtttggcaa caccggcagc aacaacatcg gcttcggcaa taccggagac 840 

ggcaaccgag gtatcgggct cacgggtagc gjgtttgttgg ggttcggcgg cctgaactcg 900 

ggcaccggca acatcggtct gttcaactcg ggcaccggaa acgtcggcat cggcaactcg 960 

ggtaccggga actggggcat tggcaactcg ggcaacagct acaacaccgg ttttggcaac 10 20 

tccggcgacg ccaacacggg cttcttcaac tccggaatag ccaacaccgg cgtcggcaac 1080 

35 gccggcaact acaacaccgg tagctacaac crcgggcaaca gcaataccgg cggcttcaac 1140 

atgggccagt acaacacggg ctacctgaac agcggcaact acaacaccgg cttggcaaac 1200 

tccggcaatg tcaacaccgg cgccttcatt actggcaact tcaacaacgg cttcttgtgg 1260 

cgcggcgacc accaaggcct gattttcggg afcxgccccggct tcttcaactc gaccagtgcg 1320 

ccgtcgtcgg gattcttcaa cagcggtgcc cjgtagcgcgt ccggcttcct gaactccggt 13 80 

40 gccaacaatt ctggcttctt caactcttcg tcgggggcca tcggtaactc cggcctggca 1440 

aacgcgggcg tgctggtatc gggcgtgatc aactcgggca acaccgtatc gggtttgttc 1500 

aacatgagcc tggtggccat cacaacgccg g-ccttgatct cgggcttctt caacaccgga 1560 

agcaacatgt cgggattttt cggtggccca crcggtcttca atctcggcct ggcaaaccgg 162 0 

ggcgtcgtga acattctcgg caacgccaac Ettcggcaatt acaacattct cggcagcgga. 1680 

45 aacgtcggtg acttcaacat ccttggcagc cygcaacctcg gcagccaaaa catcttgggc 1740 

agcggcaacg tcggcagctt caatatcggc a_gtggaaaca tcggagtatt caatgtcggt 1800 

tccggaagcc tgggaaacta caacatcgga tccggaaacc tcgggatcta caacatcggt 1860 

tttggaaacg tcggcgacta caacgtcggc ttcgggaacg cgggcgactt caaccaaggc 192 0 

tttgccaaca ccggcaacaa caacatcggg ttcgccaaca ccggcaacaa caacatcggc 1980 

50 atcgggctgt ccggcgacaa ccagcagggc ttcaatattg ctagcggctg gaactcgggc 2040 

accggcaaca gcggcctgtt caattcgggc accaataacg ttggcatctt caacgcgggc 2100 

accggaaacg tcggcatcgc aaactcgggc a_ccgggaact ggggtatcgg gaacccgggt 2160 

accgacaata ccggcatcct caatgctggc agctacaaca cgggcatcct caacgccggc 222 0 

gacttcaaca cgggcttcta caacacgggc agctacaaca ccggcggctt caacgtcggt 2280 

55 aacaccaaca ccggcaactt caacgtgggt gacaccaata ccggcagcta taacccgggt 2340 

gacaccaaca ccggcttctt caatcccggc aacgtcaata ccggcgcttt cgacacgggc 2400 

gacttcaaca atggcttctt ggtggcgggc gataaccagg gccagattgc catcgatctc 2460 

tcggtcacca ctccattcat ccccataaac g-agcagatgg tcattgacgt acacaacgta 2520 

atgaccttcg gcggcaacat gatcacggtc accgaggcct cgaccgtttt cccccaaacc 2580 

60 ttctatctga gcggtttgtt cttcttcggc ccggtcaatc tcagcgcatc cacgctgacc 2640 

gttccgacga tcaccctcac catcggcgga czcgacggtga ccgtccccat cagcattgtc 2700 

ggtgctctgg agagccgcac gattaccttc cztcaagatcg atccggcgcc gggcatcgga 2760 

aattcgacca ccaacccctc gtccggcttc ttcaactcgg gcaccggtgg cacatctggc 2820 

ttccaaaacg tcggcggcgg cagttcaggc gjtctggaaca gtggtttgag cagcgcgata 2880 

65 gggaattcgg gtttccagaa cctcggctcg ctgcagtcag gctgggcgaa cctgggcaac 2940 

tccgtatcgg gctttttcaa caccagtacg cjtgaacctct ccacgccggc caatgtctcg 3000 

ggcctgaaca acatcggcac caacctgtcc cjgcgtgttcc gcggtccgac cgggacgatc 3 060 
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ttcaacgcgg gccttgccaa cctgggccag 
ttcaacctgg gcagcggaaa cgtcggcagc 
tacaatatcg gtccggcgaa cctgggtaac 
tacaacatcg gcttcggcaa cgccggcgat 
5 aacaacatcg gatttgccaa caccggcaac 
aaccagcagg gcttcaattt tgctggcggc 
ttcaactccg gcaccaacaa cgtcggcatc 
ggcaactccg gcagcggcaa caccggcatc 
ttcaacaccg gcatcgtcaa caccggtgtc 
10 tacaacaccg gcgacaccaa caccggcatc 
tacaacaccg gcaatttcag tacgggcttt 
ttcatcaccg gcgacatggg caacggcgcc 
agcgcgggct atcgggtcca tgttcccgaa 
gtcaacatcc ccatcaccgc cagcttcacc 
15 caaatcaact tcggtttcac catcgacatc 
agcaaggccg ttctcccgcc catcaccggg 
gaccctggcg gttcgaccgc gatcaggatc 
gtcacgttcg tcaacattgc ggctaccacg 
tcgggcttct tcaacggcgg ccccggaacc 

20 atttccggct tccagaacgt cgcgaactcc 
ctgcaatcgg gactggcgaa cctgggcgat 
ggggcaccgg ccaacgtctc gggcatgttc 
cacgaccagg cgaccgggat gtcgatgttc 
aacgtcggct tctccaacgt aggcgacagc 

25 aacctcggca gcggcaacct cggcagcttc 
aacatcggcc cggcgaacct gggtaactac 
aacttcggat tcggcaacgc cggcgacttc 
aacatcgggt ttgccaacac cggcaacaac 
cagcagggct tcaattttgc cggtggctgg 

30 aactctggca ccaacaacat cggtttgttc 
aactcgggca ccggaaactg gggcatcgcc 
aacaccggcg acgtcaacac cggcttgctc 
aacaccggcc attacaacac cggcagcttc 
aacccgggta gctacaacac gggttatttg 

35 aactcgggcg atgtcaacac cggcggcttc 
tggcggggcg attaccaagg cctggcgggg 
gccgttccgg tgaaactgca cgtgccgatc 
ggcacgttca ccgttcatgg cttcagattc 
ggcataccgt tcaatgccgc cacactcgat 

40 cttcccaata tcggcatcaa cctgggtagc 
accggcggtc tattgccgat caagattcca 
gggaactcga cgaccacccc gtcgtcggga 
ggcgttggca acgtgggcag caatagttcc 
ggaatctcgg gcgtccagaa cttcggcgag 

45 actgtctccg gccttgtcaa tgcgagcacg 
gg c gg a ggga atgtcggtgc tacggtcgcc 
ctcgggtttg gcaacgtagg cagcgggaat 
gtcggcctcg gaaacctcgg caacgcgaac 
gtcttctccg gaaaccgggg ctcatacaat 

50 atcggcctag gcaacctggg cagctacaac 
ctgggcttcg ccaacagcgg cagcaacaac 
atcggtatcg ggctgtccgg ccacaatcaa 
gccaacaccg gcctgttcaa ctcgggcacc 
ggaaacatcg gcattggcaa ctcgggcatc 

55 ggcaacaccg gcttggggaa ctcgggcacc 
ggcaatatgg gcgtcgccaa cgtgggcacc 
accaacaccg gcatcgccaa cgtgggcatc 
accaacaccg gcagcttcaa cgacggcgac 
tacaacaccg gcttctacaa caccggcgac 

60 ttcagcaacg gcgccttctg gcagagcgat 
atcactgttc cccagatccc gctactgaac 
atccatctcg acttcggtac ccttgccgtc 
cgcgccctcg gggtcaccca cttcagcgtc 
accttaccgg tgatcgatat caacatcggc 

65 acgatcacca gcggcgccgg cccggtcgtc 
ggtttcggaa actcgaccac cggcccctca 
tcgtctggat tcggaaacgt cggcgccaac 



ttgaacatcg gcagcgcaaa tctcggcgac 3120 

ttcaacgtct tctccggaaa ccagggctca 3180 

tacaacatcg gtttcgcgaa cctgggtaac 3 240 

ttcaaccaag gctttgccaa caccggcaac 33 00 

aacaacatcg gcatcgggct gtccggcgac 33 60 

tggaactcag gcaccgcgaa catcggcttg 3420 

ggcaactcgg gcaccggcaa ctggggtatc 3480 

ggcaacaccg gcagcactaa cacgggcttc 3 540 

gccaacgcgg gcagctacaa caccggctgg 3 600 

gccaacctgg gcgacttcaa cacgggcttc 3660 

gccaaccagg gtgatatcgc caccggggct 3720 

ttctggcgcg gcgaccagca gggcctattc 3780 

atacccgcac acgtcaccgt ggaagttccc 3840 

aacaccgtct acagcggcat aacgcttgag 3 90 0 

gcagggatcc ccctgctggc cggtgcaatc 3 960 

accggtcccg cgatcacggt caacatcggc 402 0 

ccggccaccg caagcgtcgg tcccttcgat 408 0 

ggctttttca acgccaccac cgatccgtcc 4140 

gtatcgggca tcgccaacat cggcgccaac 42 00 

gcgacctcgg gcttcaacaa ctacggctcg 42 60 

accgtctcgg gcgtattcaa caccggcatc 432 0 

aacatcggca gcaacctcgc ggggttcttc 43 8 0 

aacctcggcc tgggaaacat cggccaattc 4440 

aacgccggct tggcgaacat cggcagcttc 450 0 

aacgtcttcg gcggaaacca gggctcatac 4560 

aacatcggcc taggcaacct gggcagctac 462 0 

aacctgggct tcgccaacac cggcaacaac 4680 

aacatcggta tcggcctgtc cggcgacaac 4740 

aactccggca gcggcaacag cggcctgttc 4800 

aactcgggca ccggcaacat cggcatcggc 4860 

aacaccggtg acaccaacac cggcatcttc 4920 

aacgccggca acgtcaacac cggcatcttc 4980 

aacgccggca gcttcaacac ggccggcttc 5040 

aacaccggca gctacaacac cggactggcc 5100 

atcaccggca attacagcaa cgggttctgg 5160 

atcagccaaa cgatcaccgt gcccgacacc 5220 

ttcctcgata tcccggtcac cggcacactt 52 8 0 

ccggagatca ccggcgatat" cttcttgatc 5340 

gcattcagtt tcccgaacat ctcgattgtc 5400 

gggccggacc cgctgatcga tatcgccggc 5460 

ctcatcgata taccggcggc cccgggattc 552 0 

ttcttcaacg ccggtaccgg taccgtgtcg 5580 

ggcttcttca acctcacctc tggaagctcg 5640 

ctgatctccg gcgggttcaa cttcggtaac 5700 

cttgggcttt cgatgccggc caatctctcc 5760 

ggcttcgtca acaacaccca gatcctcaac 5820 

gtcggccacg gcaatatcgg cgactccaac 5880 

gtcggccatg gcaacatcgg cagcttcaac 5940 

atcggcccgg cgaacctggg taactacaac 6000 

ttcggattcg gcaacgccgg cgatttcaac 6060 

atcgggtttg ccaacaccgg caacaacaac 6120 

caggggttcg gctcctggaa ctccggtacc 6180 

aacaacatcg gtttgttcaa ctcgggcacc 6240 

ggaaacaccg ggatcggcaa cccgggtgtc 63 00 

ggcaactggg gcctgtggaa cccaggcacc 63 60 

tacaacaccg gtggctacaa cgtgggcagc 6420 

gccaacacgg gcagctacaa caccggcagc 648 0 

ttcaataccg gcttctacaa caccggcgac 6540 

gtgaacaccg gcgccttcat cgggggcaac 6600 

caccaaggcc agtggggcgc acactacgca 6660 

tttagcctca acattccggt caacatcccc 6720 

aacggcttcc agattccggc tatcaccctc 678 0 

ggacccatca tcgttccgag gatcgccggc 6840 

gaccccggcg gttcatcctc gatacccatc 690 0 

atcccgctac tggacatccc gccggccccc 6960 

tcgggcttct tcaactccgg caccggcagc 7020 

aattcgggct tctggaacac cgctttcgcc 7080 
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ggcataggaa actctggctt gcagaacttc ggctcgctgc aatccggctg ggcgaacttg 7140 

ggcaacaccg tctcgggctt ctacaacacc agtgcggcgg acttcgcgac gccggctaac 7200 

ctctcgggac tctccaacgt cggcgccgac ctgaccggcg tgctccgcgg cccgaacggg 72 60 

tcgaccttca acgcgggcct ggcaaacctc ggccaattca acgtgggcag tgcaaacctc 7320 

ggcagtgcaa acctcggcag tgcaaacctc ggcagtgcaa acctcggcaa ttcaaacgtc 73 8 0 

ggcttcggca acattggcaa cgcgaacatc ggcggcgcaa acatcggcga ctttaacgtc 7440 

gggatcgcaa acaccggtcc ggggctaacg gcggctgtca acaacatcgg tatcggcaac 7500 

accggcaact acaacatcgg tgtcggcaac accggtaact acaacatcgg cttcggcaac 7560 

accggcaaca acaacatcgg catcggcctg tccggcgaca accagatcgg gttcggcccg 762 0 

ctgaacgccg gcatcgccaa catgggcctg ttcaacctgg gcgacaacaa ctttggcatg 768 0 

gccaacgcgg gcaacttcaa ccagggcatt gccaacaccg gcaacaacaa catcggcttg 7740 

ttcaacaccg gcaacaacaa cgtcggcatc tggctgaccg gcgacggctt gtccggcttc 780 0 

agctccctga actccggcgc cggcaacacc ggtttcttca actccggcac cgccaacacc 7860 

ggcttgttca, jactccggcac cggcaacacc ggcttgttca actcgggcac cggcaacgtc- 792.0. 

ggcatcggca acatgggcac cggcggcttc ggcgtcggcc tatccggcga cagccaggtg 79 8 0 

ggcatcggcg gcaccaactc gggcagtttc aacatcggct tgtttaactc gggcaccggc 8040 

aatgtcggca tcggcaactc gggcaccggc aacgtcggca tcggcaacac cggcaccggc 8100 

aacaccggca tcggaaacag cggcaactac aacaccggct tgctcaacgc gggcctggtc 8160 

aacaccggca tcgccaaccc gggcaaccac aacaccggcc tgttcaacat cggcaccttc 822 0 

aacaccggca tcgccaaccc gggccactac aacaccggct cctacaacac cggtagctac 8280 

aacaccggca tggcaaacgc cggagactac ggcaccggcg cgttcatcac cggcagcatg 83 4 0 

aacaacggct tgctctggcg cgccgaccgg cagggcctgc tggcggccaa ctacaccatc 84 0 0 

accatcgagc gacctgccgc gttcctcaat gtcgacatcc cggtcaacat ccccatcacc 8460 

ggcgacatca ccaatgtctc catccccgcc attacgttcc ccagaatcga cgccagcgga 8520 

agcgtcgaca taggcatcct cagtggcacc gtcttggccc cggtcggtcc gatcaccctg 858 0 

catggcgggg acgcgtcggc cccgctggac acacccatcg aaattgactt cggcccctcg 8640 

ccggcgatca acctcaacat cggcaagccc gacggctcca ccgtgatcaa catcgtgggc 8700 

ggcgccggcg ccggcccgat cagcattccg atcatcgact tgcggccagc gcccggcttc 8760 

ttcaacgcca ccaccggccc gtcgtcgggc ttcctcaact, ggggtgctgg cagcgcatcg 882 0 

ggcttgctga acttcggcaa caactcgggc ctctacaact tcgccactag cagcatggga 888 0 

aattcgggct tccaaaacta tgggtcgctg cagtcgggct gggcgaattt gggcaacagc 8940 

atctcgggca tctacaacac cggcttggga gcaccggcaa atgtctcggg cttgctcaac 900 0 

atcggcacca acctggctgg gtggttgcag aacggcccga ccgagacgac cttcagcgtg 90 60 

ggcttggcca acctcgggtt ctggaatctg ggtagcgcaa acatcggcaa ctacaacctg 9120 

ggcagcgcca acatcggcgt ctacaacctg ggcagcgcca acatcggcga cttcaacctg 918 0 

ggcagcgcca acatcggcga cttcaacctg ggcagcgcca acatcggcag ctccaacatc 9240 

gggttcggca acgtcggtcc ggggctgacg gcggccatcg gcaacatcgg cttcggcaac 93 0 0 

accggaaacg gaaacatcgg catcggcaat accggcaccg gcaacatcgg cttcggcaac 93 60 

accggaaacg gaaacatcgg catcgggctg accggcgaca ccatgaccgg gttcggcggc 9420 

tggaactcgg gcaccggcaa catcgggcta ttcaactccg gcaccggcaa catcggcttc 948 0 

ggcaactccg gcaccggcaa ctggggcatc ggaaactccg gtgactacaa caccggcatc 9540 

ggcaacaccg gcagcaccaa ctccggcttc ttcaacaccg gcctggtcaa caccggcatt 960 0 

ggcaactccg gtgactacaa caccggccta ttcaacgccg ggaacaccaa caccggcagc 9660 

ttcaaccccg gcgactacaa caccggcggc ttcaaccccg gtaactacaa caccggctac 972 0 

ttcaacccgg gcaactccaa caccggcatc gccaactccg gcgatgtcaa caccggcgcc 978 0 

ttcaattcgg gcaactacag caacggcttc ttctggcggg gcgactacca gggcctaggc 9840 

ggtttcgcct accagagcgc cgtttccgaa atcccgtgga gctacgacag gttccaacat 990 0 

tga 9903 
<212> Type : DNA 
<211> Length : 9903 

SequenceName : SEQ ID 517 

SequenceDescription : 



Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgaacctgg tctccacaac gtcgggaatg tcgggcttcc tcaacgtcgg cgcgctggga 60 

tcgggtgtgg cgaatgtggg caacaccatc tcgggtatct acaacgtggg cacgtcggac 12 0 

ctctcgacgc ccgccgttaa ctccgggttg gcaaatatcg gaaccaatat tgccggcctg 18 0 

ctgcgcgacg gcgcgggtac tgcggctatt aacttgggct tggccaacca cggcaacctc 240 

aacgtgggct tcgcaagtct cggcggcttt aacttcggcg gcgccaccat cggccacaac 3 00 

aacgtcggca tcgggaacac cggaatcttc gatgtcggcc tggcgaacct gggcagctac 360 

aacatcggct tcggaaacct tggcgacgac aacctgggct tcggcaactt cggcagctac 42 0 

aacatcggct tcggcaacgt cggcaacgac aatctgggtt tcgctaacgc gggcggcggc 480 

aacatcggct ttgcgaacac cggcagcaac aatgtcggct ttgggaacac gggcagcaac 540 

aatgtcggca tcgggctcac gggcaacgga cagatcgggt tcggcagctt caactcgggc 600 
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agcggaaaca tcggcctgtt caactcgggc 
agcggcaact tcggcatcgc aaactcgggc 
aacaccaata ccggcctatt caactccggc 
agcttcaaca ccggtagctt caacaccggc 
5 aataccaaca ccggctacct caacattggc 
gacgttgaca ccggggcttt catcaccgga 
gattaccagg gcctggtcgg cctcaacctg 
ggcgtgaata ttcccatcga tatcccgatc 
ggcgtcacga ttccgcccac cggcgatatc 

10 cactttggcc ccattaccat tccgaacatc 
ataggagggc cgaataccgc gatcaccata 
atcagtatcc cggcggcgcc aggtttcgga 
ttcaataccg gcgccggcgg cgcctcgggc 
ttttggaacc t tggcctqcgc gacctcgggg 

15 ggatcaggtc tggcgaacgt gggcaccacc 
gacctcgcga cgccggcctt caattcaggc 
ctgctgcgcg acagcacggg caccatggtc 
ctcaacgtcg gcattgcaaa cctcggcgac 
gccaacttcg gcagcgccaa tatcggtggc 

20 ttcgacatcg gtttggcaaa tctgggcagt 
gacaacctgg gcttcggaaa cctcggcagc 
gacaacctgg gcttcgccaa caccggcagc 
aacaatatcg gcattgggct cacgggcgac 
ggcagcggaa acatcggctt gttcaactcg 

25 ggcaacggaa acgttggcat cggcaacacc 
ggcagcacca acaccggctt cttcaactcc 
ggcagcttca acaccggcag cttcaatccg 
ggcagctaca acacgggact cggaaacacc 
ggcagctaca gcaacgggtt cttgtggagt 

30 gcggcgctag cgattcccga aatcgcccta 
cccatcaaca tcgacgccgg ggtcgtcacc 
aataatatcg acttcacccc catcatcatc 
gcgatcaccg tgggcggacc caccacctcg 
tccatcacca tcccgatcat cgacattccc 

35 agtccgtcgt cgggcttctt caactccgga 
gtcgccggcg cctcagggat ttcgggttat 
actaacgtgg gtcacaccgt ctcgggtttc 
ccggcctttg cctccggtct catgcgcgac 
ctggcaaacc tgggcagcaa taacgccggc 

40 gtggcgaatc tgggcaacta caacatcggc 
tttgccaacc taggcagcta caacatcggc 
tttgccaaca ccggcagcaa caatatcggc 
atcggcgctc tgaactcggg cagcggaaac 
atcggcttct ttaactcggg caccgggaac 

45 ttcggcatcg gcaactcggg cagcaccagc 
accggcggct tcaaccccgg taacttcaac 
accggcggct tcaacgcggg taacaccaac 
accggcatcg cgaatacggg cgacgtcagc 
aacggcatct tgtggcgggg cgactaccag 

50 attccggaga ttccggcgca cttggacgtc 
agtttcaccg acctcgtggt ggacaatttc 
gcgtttagct ttcacatcca taccgagccg 
gtgctcagcg ttcccacgtt cgcgatcgcc 
agcgccaccg ccggcctcgg ccccatcacc 

55 ggcatcggaa actcgaccac cagcccgtcg 
gcatccgggt tcggcaacgt cggcggcaac 
gcctcaggag tctcgggctt gctcaacgtc 
ggcaacacca tctcgggtat ctacaacacg 
ggctccggcc tcgcaaacat cgccggcctg 

60 gacttggccg gcctcggcaa cctcaatgtc 
gggatcggga acaccggaat cttcaatgtc 
ggcttggcaa acctgggcaa ctacagcgtc 
ggcattgcta acaccggcag tgccaatatc 
ggcatcgggc tcaccggcac cggtcagatc 

65 aacatcggct tgttcaactc cggtgacgga 
aacgtgggca tcggaaacac cggcaccgca 
aacaccggcc tcgggaacac gggcagcacc 



agcaacaaca tcggattctt caattccggc 660 
agcttcaaca ctggcatcgg aaacaccggc 72 0 

gacgtcaaca cgggcgcctt caacccgggc 780 
agcttcaaca ccggtggctt caatccgggc 840 
aactacaaca ccggcatcgc caacaccggc 900 
aactacagca acgggttgtt cttaagcggc 9 60 

gtgatcgata tgcctctccc cataagcctc 1020 

accgcctcgg ccggcaacat cacccttatg 10 8 0 

gtcctttcgt caatagcggg ccagcgagcc 1140 

acggttgtcg gccccacgac gacagtcgcc 1200 

actggcggtg gcgccattag gatcccgctc 12 60 

aactcgacca ccaacccgtc gtcaggtttc 1320 

ttcggcaact tcggcggcgc caattcgggc 13 80 

gcgtcggggq tcctcaacgt .cggcgccctg -1440 

gtctcggggt tctacaacac cagcacgtcg 15 0 0 

ctggccaaca tcagcaccag tatcgccggc 15 60 

ctcaacctgg gcttggcaaa ccacggcacc 162 0 

tacaacatcg gctttgcaaa cctcggcagc 168 0 

aacaacatcg gcggcgcaaa caccggaata 1740 

tacaacatcg gcttcggaaa cttcggcgat 18 0 0 

tacaacgtcg gcttcggaaa cttgggcaac 186 0 

aacaatatcg ggttcgcgaa caccggcagc 192 0 

ggccagatcg ggttcggctc cctgaattct 1980 

ggcagcggaa acatcggctt tttcaactcg 2040 

ggcaccgcaa acttcgggct tggaaacacc 2100 

ggcgacgtca ataccggtat cggcaacacc 2160 

ggcgattcca acaccgggga tttcaaccca 222 0 

ggcgatgttg acaccggcgc cttcatctcc 22 8 0 

ggaaattatc agggcctcat tggcttgcac 2340 

acctttggcg tcgacatccc gatacatata 2400 

ctccagggct tcagcatcgt agctgccgaa 2460 

ccgaccatca atatcacctt gcccacggcg 2520 

atcggtatca ccgccagcgc cggtatcggc 2580 

gcgacatcgg gcttcggcaa ctcgaccact 2640 

gcgggcagcg cgtcgggctt tttgaacgtg 2700 

ctcaatgtcg gtgcgctggg gtcgggtgtg 2760 

tacaacgcga gcgcgttgga cctcgtgacg 2820 

ggtatgggca cgatgactct gaaccttggg 2880 

ttcggcaaca ccgggatctt tgacgtcggc 2940 

ttcggaaact tcggcgacga caacctgggc 3000 

gttgccaaca ccggcagcaa caatatcggc 30 60 

atcgggctca ccggtaccgg ccaaatcggg 3120 

atcggcttgt tcaactcggg cgacggaaac 3180 

ttcggcatcg gcaacaccgg caccggaaac 3240 

acgggcttgt tcaactcggg cgacggcaac 33 00 

accggcaatt tcaataccgg cagcttcaac 3360 

accggccact tcaacaccgg gaactacaac 342 0 

accggcgctt tcatctccgg caactacagc 3480 

ggcctgatcg gttactccta cgcgctgact 3 540 

aatatcccaa tcgacatacc gatcaccggc 3600 

actatcccca tcatcggctt cgaatccttc 3660 

gacatcggtc ccatcattgt cccgagcttc 3720 

gtgggcggac ccacgaccgc gatcaacatc 3780 

atcccgatca tcgacattcc ggcagcgccg 3 840 

tcaggcttct tcaacaccgg cgccggcacc 3900 

acatcgggcc tgtggaacct tgcgtcggca 3 960 

ggcgcgttgg gatcgggtgt ggcgaatgtg 4020 

agcccgctgg acctcgggac gccggccttc 4080 

ctgcagggcg gcgccggcac gacgatcctc 4140 

ggcttggcaa acctcggggg ctctaacttc 42 00 

ggtttcgcaa acgtgggcaa ccacaacatt 4260 

ggcttcgcca actcgggcaa ctaccatatc 4320 

ggcttcgcca acaccggtag cggcaatatc 43 8 0 

gggttcggca gcttcaactc gggcagccac 4440 

aacgtaggat tcttcaactc gggcaccggc 4500 

aacttcggca tcgcaaactc gggcagcttc 4560 

aacacgggcc tgttcaaccc gggcaacgtc 4620 
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aacaccggcg tcggcaacac cggcagcatc aacaccggca gcatcaacac cggcagcttc 468 0 

aacactggca gcaccaatac cggcagcttc aacctcggcg atcacaacac cggcagcttc 4740 

aactccggtg actacaacac gggctacttc aacgcgggtg actacaacac gggtgtggcc 48 00 

aacacgggca acgtcaacac cggcgcgttc atctccggca attacagcaa cgggttcttc 48 60 

5 tggcgaggtg actaccaggg gttgattggc ctttccacaa cgatcaccat tcccgaaatc 492 0 

ccctaccgct acgacttgag tgttccaatc gacataccca tcaccggcac cgtcgtcgcc 4980 

accacgccaa acagtttcac cattcccggt ttccagatac gagtcttgct tggtcctgcg 5040 

gcggtgcttg tcaacgagat gatcggcccc atcacgatcg atgtcaatca agtcatcgcc 5100 

atcgattcgc ccattcagca aaccatcagc atggttggca ccggcggctt cggcccgatc 5160 

10 cccatcggca tcagcatcgg tggtaccccg ggtttcggca actcgaccac cggcccgtcg 522 0 

tcgggtttct tccacaccgg cgccggccat gtatcgggct tcgggaactt cggcgccggc 52 80 

aacatgtcgg gctccgggaa cttcggcgct ggcaattcgg gcttctttaa cgccggcggc 53 40 

ttgggcaatt cgggcctact gaatttcggc gcgctgcagt cgggtctggc gaacctgggc 54 0 0 

aacaccatct cgggcgtcta caacacgagc acgctggacc ^fccigcgacycc cgccttcggc 545 Q 

15 tcgggcatcg caaacatcgg cgccaacctg gccggcctgt tcctcgacaa caccggcaac 552 0 

ctgacgctga acttcggcgt cgcaaaccag ggcggcctca acgcgggcat cgggaacctg 5580 

ggcagcgtca acatcggctt cgttaatacc ggcgactcca acctgggcat cggcaacctc 564 0 

ggcgacctca acttcggcgg ggtcaacatc ggcggtaaca acatcggcat cgccaacacc 5700 

gggatcttcg atatcggctt ggcgaacctg ggcagctaca acatcgggtt ggcaaatctg 5760 

20 ggcgacgaca acctgggctt tggcaacgcc ggcagctaca acatcggctt cgcgaacttc 5820 

ggcagcgaca acctgggctt tgccaacacc ggcagctaca acatcggctt cgcgaatacc 58 8 0 

ggtaacaaca acatcggcgt cgggctcacc ggcaacggcc agatcgggat cggcagcctc 5940 

aactcgggca gcaacaacat cgggctgttc aactccggca gcggaaacat cgggttcttc 60 0 0 

aactcgggca ccggcaacgt cggcatcttt aacaccggca ccggcaactt cggtctcgcg 6060 

25 aactcgggcg gcttcaacac cggcatcggc aacgcgggca gcaccaacac gggcgtgttc 612 0 

aaccccgggg acctcaacac cggcagcttc aacccgggca gcttcaacac cggcggcttc 618 0 

aacccgggca gtggcaacac gggctacctc aacaccggtg actacaacac gggcgtggcg 6240 

aacacgggcg atgtggacac cggtgcgttc attaccggca gctacagcaa cggcttcttg 63 00 

gtgagtggcg actatcaggg cctgatcggc ctgccgctgt tgggcattcc ggtgaccccc 63 60 

30 ggctacttca acctcactgg cggcccgtcg tcgggcttct tcaacagcgg cgccggaagc 642 0 

gtatcgggat tcgtgaactc cggtgccggc ctgtcgggct acctcaatac cggcgcgctg 6480 

ggatcgggtg tcgccaacgt gggcaacacc atctcgggct ggttgaacgc cagcgcgctg 6540 

gatctcgcga cgccggggtt cctttccggc atcggtaact ttggcaccaa cctggcgggt 660 0 

ttctttaggg gataa 6615 

35 <212> Type : DNA 

<211> Length : 6615 

SequenceName : SEQ XD 518 
SequenceDescription r 

40 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgtcgttcg tgttgatcgc accggaattc gtgacagcag ccgcggggga tctgacgaat 60 

45 ctgggttcgt cgattagcgc ggccaacgcg tcggcagcca gtgcgaccac gcaggtgctg 12 0 

gctgcgggcg ccgatgaggt gtctgcccgt attgcggcgc tgttcggcgg gtttggcctg 180 

gagtaccagg cgattagtgc gcaggtggcg gcctaccacc agcggtttgt gcaggccttg 240 

agtaccggcg cgggcgcata tgcctcggcc gaggccgccg ccgctgagca gatcgtgctg 3 00 

ggcgtgatca atgcgcccac ccaggcgctg ctggggcgcc cgttgatcgg tgacggcgcc 3 60 

50 aatgcgacga ctcccggcgg ggccggcggg gccggcggtc tgctgttcgg caacggcggg 420 

gccggggcag ccggggcgcc cggccaggcc ggcgggcctg gcgggcccgc cggattgtgg 480 

ggcaacggcg ggcccggcgg ggccggcggc agcggtgggg gcaccggcgg tgccggcggc 540 

gccggtgggt ggctgttcgg ggttggcggc gccggcggtg tcggtggggc cggtggcggc 600 

accggcgggg cgggcgggcc cggtggtttg atctggggcg gcggcggggc cggcggtgtc 660 

55 ggtggggccg gtggcggcac cggcggggcc ggcggccgcg ccgagctgct gttcggcgcc 720 

ggcggtgcgg gtggggcggg caccgacggc gggcccggtg ctaccggcgg gaccggcgga 780 

cacggcggag tcggcggcga cggcggatgg ctggcacccg gcggggccgg cggggccggc 840 

gggcaaggcg gggcaggtgg tgccggcagc gatggtggcg cgttgggtgg taccggcggg 900 

acgggcggta ccggcggcgc cggtggcgcc ggcggtcgcg gcgcactgct gctgggcgct 960 

60 ggcggacagg gcggcctcgg cggcgccggc ggacaaggcg gcaccggcgg ggccggcgga 1020 

gatggcgttc tggggggtgt cggtggcact ggtggtaagg gcggtgtcgg cggcgtggct 1080 

ggcctcggcg gggccggtgg tgccgcgggc cagctcttca gcgccggagg cgcggcgggt 1140 

gccgttgggg ttggcggcac cggcggccag ggtggggctg gcggtgccgg agcggccggc 1200 

gccgacgccc ccgccagcac aggtctaacc ggtggtaccg ggttcgctgg cggggccggc 1260 

65 ggcgtcggcg gccagggcgg caacgccatt gccggcggca tcaacggctc cggtggtgcc 13 20 

ggcggcaccg gcggccaagg cggcgccggc ggcatgggtg gctccggtgc tgataatgcc 13 80 

agcgggattg gcgccgacgg cggcgcgggt gggactggcg gtaacgccgg cgccggcggg 1440 
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gccggcgggg ccgccggcac cggaggaacc 
ggtatcggcg gcaccggcgg ccaaggcggc 
gcgaccgcta ccggtgccac cggcggcacc 
ggggccggcg gcaacaccgg ggttggcggc 
5 ggcggcgcgg gcggcgccgg tggtgctggc 
ggcggcaccg gcggcaccgg cgggaaaggc 
agcagcggtg ccggcggcac caacggctct 
ggcgccgggg gcgctggcgg ggccggcgcc 
ggcaccggcg gcaccggcgg agcggccgga 

10 ggcggcaccg gcggcgcggt gggcagcgtc 
acgggtggtg tcggtggtgc tggtggtgca 
accggtggcg ccgggttcgc cggcggcgcc 
ggtgtgggcg gcaccaacgg ctccggcggc 
-ggaggtgccg, gcgggtccgg cgcggacaac 

15 ggcggcacag gtggcgcggc cggcgccggc 
accggcggcg ttgtcggcgc caccggtagt 
ggtgacggcg gcgatggggc cagcggtctc 
caaggcggcc aaggcggggc cggcggcagc 
ggggccggcg gcaacggcgg cgacggcggg 

20 gacaacggcg gggtcggcgg tgacggtggg 
gcgggcgtcg gcctgacagc caaggccggc 
gggggcgccg gcggtgctgg cggggccggc 
gccggcggcc aaggcggcca aggcggcctg 
aacggcggcg ccggcggcaa cggcggcacc 

25 accctgggcg tcggcggctc cggcggcacc 
ggtggcggcg gcttcggcgg ggccgcgggt 
ggcggtgacg gcggcgatgg ggccagcggt 
ggccaaggcg gccaaggcgg ggccggcggc 
ggcggggccg gcggcaacgg cggcgacggc 

30 ggcgacaacg gcggggtcgg cggtgacggt 
aacgcgggcg tcggcctgac agccaaggcc 
aacgggggcg ccggcggtgc tggcggggcc 
ggtgccggcg gccaaggcgg ccaaggcggc 
gccaacggcg gcgccggcgg caacggcggc 

35 ggaaccctgg gcgtcggcgg ctccggcggc 
ggtggtggcg gcggcttcgg cggggccgcg 
gttggcggtg acggcggcga gggagccagc 
ggcggccaag gcggccaagg cggggccggc 
gccggcgggg ccggcggcac cggcggggcc 

40 ggcggacccg acggcggtga cggcggccaa 
ggattcggcg ccggtgttcc cggcgacggc 
ggtgttcccg gcgacggcgg gatcggcggc 
ggcgccgacg gggaccccag cattgacggc 
caaggcggca aaggcggcct gaacagcacc 

45 ggcaacggcg gggccggcgg ggccggcggc 
gggtccggcg gcaccggcgg gaccggcggc 
ggcggaaccg cgggcaacgc cggtatcggc 
gccggcgaca gcggcgccct ctcccaagac 
caaggcgggg tcggcggcaa cgccggcgcc 

50 ggcaccggcg gggccggtgg tgacggccag 
ggcgccggcg gccaaggcgg tgacggcggc 
gccggattcg gcgccggtgt tcccggcgac 
ggcgccggcg gcgccggcgc cgacggggac 
ggcggccacg gcggccaagg cggcaaaggc 

55 gccagcggtg acggcggcaa cggcggggcc 
gacggcttta tcggcgggtc cggcggcacc 
ggcctggcca acaccggcgg aaccgcgggc 
ggcggcgacg gcggggccgg cgacagcggc 
ggcggccaag gcggccaagg cggggtcggc 

60 gccggcggca ccggcggcac cggcggggcc 
gtggcgagcg agggcggcgc cggcggccaa 
ggggccggcg gcaacgccgg attcggcgcc 
accggcgggg ccgggggcgc cggcggcgcc 
ggccaaggtg gtgccggcgg ccacggcggc 

65 gggctagcca gcgccgccag cggtgacggc 
aacggcggag ccggcgggct cggcgggggc 
ggcctcggcg gaggcggcgg caacggcgga 



ggcggggttg tcggcgccgc gggcaaggcc 150 0 

gccggcggcg cgggcagcgc cggcacggat 1560 

gggttttccg gtggagccgg cggggccggc 1620 

accaacggct ccggcgggca aggcggcacc 1680 

ggtgtcggcg ccgacaaccc caccggcafcc 1740 

ggcgccggcg gggccggcgg gcagggcggt 1800 

ggtggcgctg gcggcaccgg cggacaaggc 1860 

gataacccca ccggcatcgg cggcgccggc 192 0 

gccggcgggg ccggtggcgc catcggtacc 19 80 

ggtaacgccg ggatcggcgg taccggcggt 2040 

ggtgcggctg cggccgctgg cagcagcgct 2100 

ggcggagaag gcggagcggg cggcaacagc 2160 

gccggcggtg caggcggcaa gggcggcacc 222 0 

cccaccggtg ctggtttcgc cggtggcgcc . 22,8 0 

ggggccggcg gggcgaccgg taccggcggc 2340 

gcaggcatcg gcggggccgg cggccgcggc 2400 

ggcctgggcc tctccggctt tgacggcggc 2460 

gccggcgccg gcggcatcaa cggggccggc 252 0 

gacggcgcaa ccggtgccgc aggtctcggc 2 580 

gccggtggcg ccgccggcaa cggcggcaac 2640 

gacggcggcg ccgcgggcaa tggcggcaac 2700 

gacaacaatt tcaacggcgg ccagggtggt 27 60 

ggcggggcaa gcaccacctc gatcaacgcc 2820 

ggcggcaaag gcggcgccgg tggtgcggga 2880 

ggcggggacg gcggcgatgc gggctctggt 294 0 

aaggccggcg gcggcggaaa cggcggccgc 3 000 

ctcggcctgg gcctctccgg ctttgacggc 3 060 

agcgccggcg ccggcggcat caacggggcc 3120 

ggggccggcg caaccggtgc cgcaggtctc 3180 

ggggccggtg gcgccgccgg caacggcggc 3240 

ggcgacggcg gcgccgcggg caatggcggc 33 00 

ggcgacaaca atttcaacgg cggccagggt 33 60 

ctgggcgggg caagcaccac ctcgatcaac 3420 

accggcggca aaggcggcgc cggtggtgcg 3480 

accggcgggg acggcggcga tgcgggctct 3540 

ggtaaggccg gcggcggcgg aaacggcggt 3600 

ggtctcggcc tgggcctctc cggctttgac 3660 

ggcagcgccg gcgccggcgg catcaacggg 3720 

ggtggtgacg gcgccccggc gaccctgatc 3780 

ggcggcatcg gcggggacgg cggcaacgcc 3840 

ggggacggcg gcaacgccgg attcggcgcc 390 0 

accggcgggg ccgggggcgc cggcggcgcc 3960 

ggccaaggtg gtgccggcgg ccacggcggc 4 020 

gggctagcca gcgccgccag cggtgacggc 4080 

aacggcggcg acggcgacgg ctttatcggc 4140 

gacgccggcg tcggcggcct ggccaacacc 42 00 

ggggccggcg gccgcggcgg cgacggcggg 42 60 

ggcaacggct tcgccggcgg ccaaggcggc 43 20 
ggcggcatca acggggccgg cggcaccggc 43 80 
aacggaacga caggcgtggc gagcgagggc 4440 
caaggcggca tcggcggggc cggcggcaac 4500 
ggcgggatcg gcggcaccgg cggggccggg 4560 
cccagcattg acggcggcca aggtggtgcc 4620 
ggcctgaaca gcaccgggct agccagcgcc 4680 
ggcggggccg gcggcaacgg cggcgacggc 4740 
ggcgggaccg gcggcgacgc cggcgtcggc 48 00 
aacgccggta tcggcggggc cggcggccgc 4860 
gccctctccc aagacggcaa cggcttcgcc 4920 
ggcaacgccg gcgccggcgg catcaacggg 4980 
ggtggtgacg gccagaacgg aacgacaggc 5040 
ggcggtgacg gcggccaagg cggcatcggc 510 0 
ggtgttcccg gcgacggcgg gatcggcggc 5160 
ggcgccgacg gggaccccag cattgacggc 5220 
caaggcggca aaggcggcct gaacagcacc 5280 
ggcaacggcg gggccggcgg ggccggcggc 5340 
ggtggcacag gcggcaccaa cggcaacggc 5400 
gccggcggtg ccgggggaac gcccaccggc 5460 
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agtggcaccg aggggaccgg cggcgacggt ggagatgccg gcgccggcgg caacggcggc 
tctgccaccg gcgtcggtaa cggcggtaac ggcggtgatg gcggcaacgg cggcgacggc 
ggcaacggcg cacccggcgg cttcggtggc ggcgctggcg ccggcggctt gggcggctcc 
ggcgccggcg gcggcaccga cggcgacgac ggcaacggcg gcagccccgg caccgacggc 
agctaa 

<212> Type : DNA 
<211> Length : 5706 

SequenceName : SEQ ID 519 
SequenceDe script ion : 

Sequence 



5520 
5580 
5640 
5700 
5706 



10 

Sequence 

<213> OrganismMame : Mycobacterium tuberculosis H37Rv 
<40Q>„ PregequenceString : 

15 atgtcgttgg tgatcgtggc cccggagacg gtggcggccg cagccttaga 
atcgggtcat cgatcggcgc ggccaatgcg gcggcggcgg ggtcgaccac 
gccgcgggcg ccgatgaggt gtcggcggcg atcgcgacgc tgtttggcag 
gagtatcagg cgatcagcac gcaggtggcg gcgtttcatg accgatttgc 
agcgccgcgg tcggctcgta tgtcagcgcg gaggcgacca acgccgcacc 

20 ctggagcaca acgtgctcaa tgccctcaat gcgcccaccc aggcgttgcfc 
ttgatcggtg atggggcggc tggagcaccc ggcaccgggc aggccggcgg 
atcttgtggg gcaacggtgg ggccggcggc tcgggcgcgc ccggccaagt 
ggcggggccg ccgggttgtt cggcaccggc ggggccggtg gggccggtgg 
gccggtgggg ccgggggtag cggcggctgg ctgctgggca atggtggagt 

25 ggcgggcaga gcctgctggg cggggcaacc ggcggggccg gcggcaacgc 
ggggtcggcg gaaccggcgg gcccggcggg cccggcgggc ccggcggggt 
ggtggtgccg gtggcctggg cgggaccctc tacggggccg gcgggcacgg 
gggcccggcc cgatcggtgg tgtgggtgga cacggcggtg tcgggggtgc 
ttgggcgtgg gcgggcacgg tggtgccggc ggacacggcg cggagggtgt 

30 gccggtgagg acttgtcccc gcacggtacg tccggtgggg tcggcggcga 
ggcggcaccg gagggcgggg cggctggctg gccggcgccg gtggggccgg 
gsggttggcg ggaccggcgg ggccggcggg gccggatttt ctcgtgcctt 
ggggataacg gcggtgaccc cggcgccggc ggggccggag gcaccggcgg 
acgatcggtg cccacggcgc ggccggggcc agccccacca gcggcggcaa 

35 ggcggcaacg gcgcccactt ctcatcgggc ggcaaagccg gcggtaacgg 
ggggccggcg ggctggtcgg caatggcggc gccggaggtg ccggcggcaa 
ggtgccccgc cttctggcgg cgaccctaac ggcggtggcg gcggagctgg 
ggtaagggcg gcgacggcgg agcccaagca ggcgacggcg gtgctggcgg 
aagggcggca acggcggcaa cggcgccacc ggcgccaccg gccttaacgg 

40 ggcgcggatg gcaccgacgg cggcaagggc ggcaacggtg gagccggcgg 
gccggtgggc aaggcggtaa ggcgcttgca gccacccacc aggacggcag 
ggcggggccg gcggaaacgg cggcgccggc ggcatgggcg gtgatggcgg 
aagggcacgt tcgataacgg cggcgatgga gttggtggca acggcggcaa 
cgcggcatcg gtggtgctgg tgggatcggc ggcgccggat ccaccgcggg 

45 gcccgcggcg ccaccccgac cagcggcggc aacggcggca ccggcggcaa 
gccaccgtcg ccggcggggc cggcggggcc ggcggcaagg gcggcaacgg 
ggtaatggcg gggccggcgg caaaggcggg gacggcatgg ccggtgtcgc 
cccaccaccg cgggcgaatc cggcacgagc ggccagaacg gcggggctgg 
ggggccggcg gccggggcgg agacttcggg ggcgacggcg ggaccggtgg 

50 aacggcgcca acggcgccaa cgccaccacg ccgggcgcca agggcggcga 
ggcgggcccg gcgcgcaagg cggtaacggc ggccaaggcg ggcccggcgg 
aacctctttg gccagaacgg aatccagggt gtcggcgggt ccggcggcaa 
ggcgggctcg ccggtgacgg cggcaacggc gccaacggca acttcgcctfc 
aacggcggtc acggcggtaa cggcggtaac cccggcgccg gcgggcaggg 

55 ggcgccggct ctaccccagg cgccaagggc gcccacggct tcactccaac 
gacggcggcg acggcggcaa cggcggcaac tcccaagtgg tcggcggcaa 
ggcggcaatg gcggcaacgg cggcagcgcc ggcacgggcg gcaacggcgg 
gacggcgcgt ttggtggcat gagtgccaac gccaccaacc ctggtgaaaa 
ggtaaccccg gcggcaacgg tggcgccggc ggggccggcg gggccggcct 

60 aacggtggcg ccggcggcaa cggcggcctc ggcggattcg gcggcaacgg 
gccaacggcg tggccgtggg tgcgcccgga caacccggcg gtgccggcgg 
gccggcggca acggcggggc cggcggcaac ggcggtcaag gcgtagtcag 
ggcggtgccg gtggggccgg cggcgacggc ggtgctcccg gtgacggggc 
aacggccagg gagcaggggc cttcgccggc ggcggcggcg ggcgaggcgg 

65 aacgccggca acgccggtgc cggcggcccc ggcggcaccg gctccaccgc 
gggccggccg gcagcatcct gcacgacggc ggcaacggcg gtcacggcgg 
gccagcggcg gaaacggcgg ccccggcggc cacgggggta acggcggtaa 



tgtggcgcgc 
cagcgtgctg 
ccatgctcgg 
gcagacgtta 
gttggcgacg 
gggtcgcccg 
ggccggcggg 
cggcggggcc 
ggccggcgcc 

c gg c gggg cc 

cggactgttc 
cggcggtacc 
a ggtgccggc 
ggccgggctg 
ggccggcgca 
cgccggcgat 
cggggccggt 
gattgtcgct 
agccggctcc 
cggcggggcc 
cggggccggc 
cggcgccccc 
cggcgctggc 
cgccggcggt 
cctgggggca 
tggcggtgga 
catgggtgcc 
caacggtgcc 
cggcggtagc 
tgcagacggt 
cggcgcgaac 
cgggcttgtt 
cggttcctcg 
cggggcgggc 
ggccggcggc 
cggcggccac 
tttggcgggg 

a gggggggcc 

cggcgatggt 
cggtagcggt 
cagcggcggc 
cggcggcgac 
ccgcggcggc 
cgggccaaac 
gaacggcggt 
cgccgccggt 
gcacggcggc 
cgacggcgcg 
caacggcggc 
cgacggcggc 
aggcaaggcg 
ccacggcgct 
cggcggcacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
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ggcgccaacg gcggcaacgg tggcatcggc gggactggcg gcgccggcag caccggcgcc 3240 

aagggcgtcc tcggcaccaa cgagggcgat ggcggtgacg gcggcagagg cggcaacggc 3300 

ggcagaggcg gcaacggcgg ccaaggcctc accggagccg gcggcaacgg tgggaccggc 33 60 

gggacacccg gcaacggcgg caacggtggc aacggcgcca gtggcgacct tgtcacctca 3420 

5 cctggtgacg gcggcggcgg gggccgcgga ggcgatgcag gacgcggagg cgatgcagga 3480 

cttggcggtt ccagcggccc aggtggcacc cccggcgact ggggcaccgg cggcaccggc 3540 

ggcaccggcg gcaccggcgg ccagggtgcc aacggcggcc tcaccggcgg cagaggcggc 3 600 

actggcggca acggcggcaa cggtaacacc ggcggcaccg gcggcgctgg cggcaccggc 3660 

ggcaccggcc acaacggctc tcagcctggg atgggtggca acggaggtgc gggcggcttt 3 720 

10 ggcggcaatg gttttgccgg cgtcggtggc cggggcggga tgggtggctc cgggggcacc 3 78 0 

ggcggcaccg gcgacgcggg gccctttgga acaggcaccg gtgggaccgg tggccacggg 3 840 

gggcagggcg gtggcggcgg cttcagcatc ctccttggtc tcggcggtct cggcggcctc 390 0 

ggtagcccgg ggtccatcgc cacaggtacc gccggcggcg ccggtggcgg cggtggcttc 3960 

ggcgggctcg .gcggcggcga attcgtgtag ,-*-•• . 3990 

15 <212> Type : DNA 

<211> Length : 3990 

SeguenceNatne : SEQ ID 520 
SequenceDescription : 



20 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> Pre Sequences t ring : 

atgtcatatg tgatcgcgac gccggagatg atggcaactg cggcttttga tctggcgcgt 60 

25 attggttcgc aggtgagcgc ggctagtgcg gtcgcggcga tgccgacgac ggaagtggtg 120 

gccgccggcg ccgacgaggt gtcggcgggc atcgcggcgt tgttcagcgc gcacgctcag 18 0 

gagtatcagg cgctaagtgc gcaggccgcg gcgtttcacg accagttcgt gcacacgctg 240 

accgcggccg cgaggtggta cacggccacc gagatcgcca acgccgcggc gatgcgagtc 3 00 

gtgctcggag ccgtgaatgc gcccacccag acgctactgg gacgcccgct gatcggcgat 3 60 

30 ggtgcgcacg ggacagcgcc tgggcagccc ggtggggccg gtggattgtt gttcggcaat 420 

ggcggcaacg gcgctgccgg tgccgtcggg caggtgggcg gcgccggcgg ggcggccggg 480 

ttatttggga ccggcggcgc cggtggcgcg ggcggggcgg gcgcacccgg aggtaccggc 540 

gggacgggtg gatggctggc gggtgggggc ggcgtcggcg gtatgggtgg ggctggtggt 600 

ggcgccggcg gggcgggtgg caacgcgggc ctgttcggca acggcggcgc cggtggtgcc 660 

35 ggtggggctg gtggtggcgc cggcggcgcg ggcggtaacg cggggtggtt tggtcatggg 72 0 

ggcgctggcg gcgtgggtgg tgtaggtgcg gccggggcca acggtgctac gcccggtcag 780 

gatggggcgg ctggtgttgc cgggtcggac gacggtgccg gcggtgacgg tttggcaggg 840 

tcggacgggg gcgatggcgg tgccggtggg gtgggcggca acggtggtcg ^ggagggtgg 900 

cttctcggta acggcggcgc cggtggcgtt ggcggfcgtcg gcggggccgg tggtgccggt 960 

40 gcggccggcg gtgccggcgg tgccggggct accggaataa atgggccggc cggtafcctcg 1020 

gcggccggcg gtgacggcgg cgccggcggc aatggcggtg ccggcggaaa cggcggcgtg 1080 

ggcggcgccg ggggtgctgg cgggtcggct ggactattgg ggtatgtcgg ccgggccgga 1140 

gacggcgggg ctggcggggg tggggggctg ggtggagcgc ctggtgacgg cggtgccggc 1200 

ggcaacggtg gcagttggct ggccgccggt gacggcgggg ccggcggtca cggtggcgac 12 60 

45 ccgggcctgg gtggggctgg cggagccggg ggggcgtcgg gcggcgccgg tgctcgcgcg 1320 

ggggccaatg gtctggcggc cggcaacgac gggccggtca gcggcggcaa cggcggcaaa 13 80 

ggtggcaatg gcgcccacgc accggtcgcc ggcggtcatg gcggtaacgg cggtgccggt 1440 

ggcaacggcg ggttggtcgg tgacggtggg gccggcggtc atggcggtga cggagccgcg 1500 

ggtgccggct atgccgatat gacggcgatc ttcctgggtt catccggtac ccccggtgag 1560 

50 gatggaggta acggcggggc cggtggggcc ggcggcgcgg gtggggccca cgccggcgat 1620 

ggcggggccg gcggtgccgg cggcaacggt ggggccggcg gggccggtgg taacggagct 168 0 

cacggtttca atgctgtgct cgtatctgac ggcggcaacg gtggtgatgg cggagccggc 1740 

ggtcgcggcg gtgacggtgg ggccggtggg gctggcggtg acgcacctgc gggtcgggcg 1800 

ggcagccagg gtgttggcgg ggacggtggt gccggcgggg ctggtggggc gcccggtaac 1860 

55 gggggcagcg gtggccgcgg cgacatggct ttcaaggatg gtgacggtgg ggccggcggc 1920 

gatggtggtg acccaggcgc cggtgggaag ggtggcgccg gcggcgccgg cgccaccgag 198 0 

ggtgtgaccg gcgcgaccgg cgctaccgtg cacagtggtg gcaacggcgg caagggcggt 2040 

aacggcgccg acgccaccgt ggctggcgcg aacggcggca agggcggtgc cggtggtaac 210 0 

ggcgggttgg tcggtgacgg cggggccggc ggcgacggtg gtagcggcgc ggccggtgcg 2160 

60 aatggcgcga acgtaggtga ggacggcgcc gacggcaccc tctcggggca accgggcgaa 2220 

ggaagcgagg ccaacggcgg tcaaggcggc gttggcggtg gcggcgcggg cggtgccggc 2280 

ggtgacggtg gtgccgggag ttcggcgtta ggtagtggcg gcaacggcgg tcgaggcgat 2340 

gcggggcagg ccggcggcgc gggcggtgcc ggcggagccg gtggcgccgg tgggtcggtg 2400 

tcgggcgacg gtggacccgg tggcaagggc ggggccggcg gtgcgggtgg tgccggagct 2460 

65 agtggtggcg gcggcggcaa gggcgcctcc ggcgccgaca gcgctgaggc cgttggaggc 2520 

gccggcggga aaggcggcga cggcggtgtc ggcggcgttg gcggtgacgg cgggcccggc 2580 

ggtgatggcg gcgccggcgg ggccgcaccc gcgggtcagg tcggcagcca cggtgtcggc 2640 
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ggcgttggcg gtgacggcgg gctcggcgga gccggcggca acggcggtga cggcggtcat 2700 

ggcagtgatg gcggggacgg cggtgacggc ggtgaccccg gcgcgggagg cctcggcggc 2760 

ttgggcggcg acagcggcaa cggcacccgc gcggccagcg gtgtggacgc cagcgaccac 2820 

gggcccggca .gcggcggcaa cggtggtaac ggtggcaacg gtgcgcaagc cagcgtcgcg 2 880 

5 ggcggcgccg ggggcaacgg aggcgacggc ggcaatgccg gccgggtcgg cgatggtggc 2940 

gccggcggca atggtggcga tggcgcggcc ggcgccaacg gggccaattc gggcgcgcct 3 0 00 

ggttcagatg ccctcgccct cggccaaccg ggaggcaacg ggggtcaggg cgacgcgggg 3 060 

caggccggcg gtgccggtgg tgccggtggt gccggtggcg ccggtgggtc ggtgtcgggt 3120 

gacggcggtg ccggcggcaa cggcggggcc ggcggcaacg gcggtgtggg cgctagcggt 3180 

10 ggggccgggg ccaggggcgc caacggtatc gacagtatcg gcggcaccgg tggggccggc 3240 

ggtggtggcg gcgatggcgg tgccggcggg gtcggtggac atggcgggga cggtggggtc 3 3 00 

ggcggggccg caccttcggg gacggtcggc agtcacggca ccggtggggt cggtggcgac 3 3 60 

ggcggactcg gcggcgccgg cggcgttggc ggggccggcg gcaacggcgg cattggtatc 3420 

accgtcgqcg gcgccggcgg 3-g n Z3 c !-'Z ! 3*~ -^ggcggtg *~ ciggcggtcgc 3480 

15 ggcggtctgg ggggtgacag cggcaacggc acctccgcgg ccaacggcgt ggacgccagc 3540 

aaacacgggc cactgaccgg cggcgacggc ggcgtcggcg gcaacggtgc caaggccgcc 3 60 0 

gcggccggcg gcgacggcgg ccagggcggt gacggcggca acgccgggct attcggggac 3660 

ggcggagccg gtggtgatgg ggccgacggc accgctgccg aagctctcgg cggtgatggc 3 72 0 

ggggccggtg gggctggagg caagggcggt gacgccggcg acatcggtga cggcggtgac 378 0 

20 ggcggcaagg gtggcgacgg cgcgcacggt gccctcggag ggctcaccgt tgctggcggc 3 84 0 

aacggtgggg ctggtggcgc cggtggggct ggtggcgccg gcggagcatt tctgggcgac 390 0 

ggcgggaacg gcggagccgg cggccaaggt ggggctggcc ggggcggcag ccccggcggc 3 960 

ggcggcgggg ttggtggaca cggcggggcg ggcggcgacg ccgggatgaa cggcggcggc 402 0 

ggaaccggcg gccagggagg caacggcgcg gccggtggcg cgggttggtc gcccgactcc 408 0 

25 gacctaaagg gcttcgacgg cttcgacggc ggcagcggtg gggcgggagg cgacggcggg 4140 

gccggtggcg ccggcggaac tcagaccggc gacggtggtg acggcggggc cggaggccta 42 00 

ggcggggctg gtggggtcgg cggtaacggc gttgacggct ttgacattaa cgaaacgacg 4260 

ggccgcgacg gcggcgacgg cggcgacggt ggctacggcg ggtggggcgg cgccggcgga 43 2 0 

aacggcgggg ccggcgggtc ggcacccgcc ggcgaggtcg gcaatcgagg cgttggcggg 43 80 

30 gacggcggtg acggcgggtc gggcggggac gctggtaatg gtggcttggg cggtgacggc 4440 

tttacctatc tcgcggattt tgacggagag cctggcggcg acggaggtga cggcggcgac 4500 

ggcgggtggg gccgccccgg cggacaaggc ggtttcggtt ccacaagtgg cgcgcacggc 4560 

aaggccggct tcggcgcccc cggcggtgac ggaggcgacg gcgggaacgg cggtcacggt 4620 

ggggacggca acggcagctt tgctgatgca ggcgacggcg ggccgggtgg caacggcggc 4680 

35 aacggcggct tgggtggggc cgggcgagac ggtggcgccc ccggtggtga cggaggcgac 4740 

ggcgggaccg gcggctccgg cggcttcggc gcccctccgc ctcgcagcat cggcggcggt 48 00 

gacggcgggg acggcgggcg cggaggtgac ggcggacgcg gtgccggtgg cttgacctcc 4860 

ggcggcgtcg gctcgtcggg cgagtcgggt ggctccggca acgggagggg cgaccctggc 4920 

tccggcggtt ccggcggtga aggcggcgaa ggcggcccct ccatctccgt taacgtcacg 498 0 

40 tag 4983 
<212> Type : DNA 
<211> Length : 4983 

SequenceName : SEQ ID 521 
SequenceDescription : 



45 



Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400>' PreSequenceString : 

50 atgtcgtttg tgttggtttc gccggagacc gtggcggcgg tggccacgga tctcaagcgc 60 

atcggcgcct cgctggccca cgaaaacgcg tcggcggccg cttcgacgac ggcggtggtc 120 

tccgcggccg ccgacgaggt atcgacggcg gtcgccgctc tgttctccca acacgcccag 180 

ggctaccaag cggcggccgc tcaggtagca gcgtttcata gccggtttgt gcaagccctg 240 

acggccggtg ccggggcgta cgcatttgcc gaggcggcca acgcgtcgcc gctacagtca 300 

55 gccatgggtg cggtaagcgc gtctgcgcag acgctgttgt cgcgcccgtt gatcggcaat 3 60 

ggcgccaatg cgacgacgcc gggcggtaac ggcggcgacg gcggatggct attcggcagc 42 0 

ggcggcaacg gcgcgcccgg cgcggcgggc cagtccggcg gtaacggcgg gtcagccgga 480 

ctgtggggta acggcggcgc gggtggcgcc ggcggcagcg gcggcgccgc cggcggcaac 540 

ggcggtaacg gcgggtggct gttcggcgcc ggcggcaccg gcggtatcgg cggcaccggt 600 

60 gctcccggcg ccatgggcgg caccggcggc aacggcggca acggcgcgct gctgatcggc 660 

ggcggcggcc tcggcggcgc cggcggcatg ggtggcaccg gcggcggcac cggcggcacc 72 0 

ggcggcaacg gcggcaacgg cgcgctgctg atcggcgctg gtggtgtcgg aggtgctggc 780 

gggatcggtg gccagggtac cggcgccggc ggtgccgccg gcgccggcgg caccgggggc 84 0 

aacggcggcg ccggggggtt gttcatgaac ggcggcgacg gcggcgccgg cggtcaaggc 90 0 

65 ggcgacggtg cggccggcga cgcggctgcc agcgccggcg gcaccggcgg caaaggcggc 960 

caaggcggcg acggcggcac cggaggggcc ggcggcgcag gcccagtgct gttcggccac 102 0 

ggcggcgccg gcggcatggg cggccaaggc ggcaccggtg gaatgggcgg cgccggcgga 108 0 
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gacggcacca ccgtcatcgc ggccggtacc gggggggagg gcggcaccgg cggcgcggcc 1140 

ggcgccggcg gagccgcagg cgctcgcggg gctctcacca gcggcggcct agccggcggc 120 0 

gtcgggg'ccg gcggcaccgg cggcaccggc ggtaccggcg gcaacggcgc tgacgccgct 12 60 

gctgtggtgg gcttcggcgc gaacggcgac cctggcttcg ctggcggcaa aggcggtaac 132 0 

5 ggcggaatag gtggggccgc ggtgacaggc ggggtcgccg gcgacggcgg caccggcggc 13 8 0 

aaaggtggca ccggcggtgc cggcggcgcc ggcaacgacg ccggcagcac cggcaatccc 1440 

ggcggtaagg gcggcgacgg cgggatcggc ggtgccggcg gggccggcgg cgcggccggc 150 0 

accggcaacg gcggccatgc cggcaacaca ggtgacggcg gcgacggcgg gaccggcggt 1560 

aacggcggca acggcaccgg aggcgtgaac ggcgccgaca acaccctcaa ccccgacacc 162 0 

10 cccggcggcg ccggggagcc cggcggggcc ggcggggccg gcggggccgg cggggccgcc 1680 

ggcggcccgg gcggtaccgg cggtaccggc ggtaacggcg gcaacggcgg caacggcggc 1740 

aacggcggca acggcggcaa cggcggcaac ggcggcaatg ccggcaacaa cagcaccaat 18 00 

gccccagtcg gtggcgaagg cggcgccggc ggcgacggcg gcgccggcgg cgcaggcggg 18 60 

gccgccaacg gcggcaccgc gggcagccag ggcactgggg gcykaggagg cgacggcggc 1920 

15 gcgggcggca acggcggcgg cggcaaggct ggcaccggca acagcggcaa ctttggggtg 1980 

gacggcgaag ccggcttcag cggcggcgcc ggtggcaacg gcggcgtagg cggggccgcc 2040 

ggcgccaatg gcggaaccgg cggcagcggt ggtaatggcg gtgacggcgg tgcgggaggc 210 0 

attggcgggg ccggcggcaa cggcataccg ggcactggca cagagcctgc cgggggcacc 2160 

ggcgccaaag gtggagacgg cggcgacggt ggcgccggcg gcgcaggcgg caatgccggc 2220 

20 ggggccggcg gccagggcgg caatgccggc cagggtggcg ccggcggtgc gggcggcaac 22 80 

gccgtgattc ccggcgacgg cgtcgggaag gcgccgcacg gcgacgcggg cggcagcggc 2340 

ggagacggcg gcaaaggcgg ccagggcggt agtggcggca ccggcggatc cggtgccccg 24 00 

atcggtggcg gcgccggagg caccggaggg tccggcggac acgccggcaa gggtggcgcc 24 60 

ggcggcatcg gcgcacaggg caccaccatc accgtgcccg ggaacggcgg caacgccggc 252 0 

25 gacggcggca acggcggcaa cgccggcgcc ggtggaaacg gcggctccgg cgacttcggt 2580 

ggcaatacca ccagcggcgc ctccggcagc ggcggcaacg gcggcaacgc cggcaccgcg 2640 

ggtagcggcg gtgcgggcgg aaccggcggc accggcctta gcggcggcaa cggtggcaac 2700 
ggcggcaacg gcggcaacgg cggtgacggc ggtaacggcg cc cacggcac cgtcggcgcc * 2760 

cagttcgtcc cggccaccag cttgcccaca cccaacggcg gggccggtgg caacggtggc 2820 

30 accggaagca acggcggcgc gcccggcccc gccggggcgc ccggccccac taccggcggt 288 0 

aacgctggca gccagggcat cggcggcgac gggggcaacg gcggcgacgg cggtaaaggc 2940 

ggtgacggcg ccgacgctgt caacgtcgta ttcatgccga ctgagccaca ggccgcgacc 3 000 

ggcactgccg gcagcgccgg tgaccccacc ggcggtaacg gagggcccgg cactcccggc 3060 

agccccatgg ttgccccgcc cccgccaacg ccaatcactc aagtccaaca gggcggtgac 3120 

35 ggtggcgccg ggggcaccgg atccaccaac gccaacgacg gcacagccac cggcggaaag 3180 

ggcggagaag gcggagtcgg cagcattctc ggcgggcccg gcggcaacgg cggaactggc 3240 

ggcaacgcct cggcaaccgg caccaacggg gtggccaacg ccgggaatgg cggcaagggt 33 00 

ggcgacggcg gccagtttgg ggccggcggc aacggtggtg ccggcggcag" cgtaaccgac 33 60 

ggatccgccg gcagcaccgc aggcaacggc ggcaacggcg gcaacgcaac caacggcacc 3420 

40 atcgcaggcc aacccgccgg cggcaacggc tcggccggcg ggaaaggcgg cgacggcggc 3480 

aacatcgccg ccggtgccac cggcaccgcc ggcaacggcg ggaacggcgg caacggcaac 3 540 

gacggcgccg tcaacgccgg caccggcggc tccggcggga acggcggtaa cgccggtggc 3600 

ggcggcgcca atggcggcga cggcggcgcc ggcggcgccg gcggggccgg cgggcgtggc 3660 

ggcaagggca tcgacggcgg gttcggcggt gacggcggca acggcggcag caacaacggc 3720 

45 accggcgccg gtggcaacgg cggcaacggc ggcaccggcg gggtcggctc ggttggcgcg 3 780 

gctggtggcg atggcggcaa cggcggcacc ggcggcttcg ccggtttcgg cggcaccgca 3840 

ggcaatggcg gttccggcgg cacgggcggg gccggcggcg acggcggcac cggcggggac 3900 

ggcggcaacg gcgttatcgc cggcggcggg gggaccggcg gcaacggcgg cgccagcggg 3960 

gccggcggcg ccggcggcac gggcgggttc gccggcaacg gcaatgccgg cggcaatggc 4020 

50 ggcaccggcg gcgcgagcga ggacggcgac aacggcaacg ctggcagcgg cgccaccggc 4080 

ggtaccggcg gcaacggcgg caccggcggc gacggcggcg ctgccgggct gggcggcgtc 4140 

gcgtga 4146 
<212> Type : DNA 
<211> Length : 4146 

55 SequenceNarae : SEQ ID 522 

SequenceDe script ion : 

Sequence 



60 <213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgtcgttcg taatcgcgac gccggagatg ctgaccacgg cggcaacgga tttggcgaaa 60 

attggttcga cgatcactgc ggccaacacc gcagcggccg cggtggcgaa agtgctgccc 120 

gcgtcagccg acgaggtgtc ggtggccgtt gcggcgttgt tcggcacgca cgcccaggaa 180 

65 tatcagaccg tcagcgccca agtggcgacg tttcatgacc ggttcgtgca gaccctgtcc 240 

gcggccgcga gctcgtacgt ggccgccgag gcggtcaacg ttgaacagag tttgctagcc 300 

gcggtcaatg cgcccaccca ggcgctgttc ggacgcccgc tgatcggcaa cggcgccgac 3 60 
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ggctcccccg ggaccgggca ggccggcggg 
aacggcgggt ctggggcgcc aggacaacga 
ggtaacggcg gcaacggtgg agccggcggc 
ggcggcgcgg gcgggtggct gtatggcaac 
5 gcggtcggcg gcaacggcgg ggccggcggt 
ggtggggccg gaggcaacgg catcgccggt 
ggatccggca ccgctggcgg ggccggcggg 
ggcggggtgc tgatgggcaa cggcggcaac 
ggcgccggcg gtgcgggcgc cagcggcgcc 

10 gccggcggaa acggcgggaa tggtggggct 
ggcggccacg gtctgctcgg cctcggcggc 
ggcggtgacg gcggagctcc cggtgacggc 
aaccttggtg ccggcgggac cggtggcaat 
ggcgccggtg. gcgccagcgt cggcggcagc 

15 acctccacca gcggtggtaa cggcggcgac 
ggacagaccg gcgcgaacgg tggcaggggt 
gccggtgggg ccggtggccg cggcggggcc 
ggtcgccccg gcggggccgg cggcaccggc 
ggtgacggcg gcaccggagg tgctggcggg 

20 ggcagcatcg gcttcaatgc gtccgctccc 
ggtaacggtg ggcctggcgg agccggcggc 
gcctcgggcc agaacggcag ccagggcgct 
gggactcctg gcaacggcgg tcacggcgcc 
ggtggcgccg gtggccacgg cggtgatccc 

25 agcggctcca cccccggtgc caacggcgca 
gg c gg caac g gcggcagagg cgccgatgcc 
ggcaggggcg gtgatggcgg tttggtcggc 
ggcagcaaag gcctgcccgg tctgggcagg 
ggcggaaacg gtggggccgg tggatccggc 

30 ggggccggcg gcaccggcgg cgtcggcggg 
ggttccagcg cgggcgcgga cgggcacccc 
ggaaaaggtg gggacggcgg ggacggcggg 
ggccccggcg gtgccggcgg cgacggtggg 
ggcattgacg gcgccgacgg cgccaccgcg 

35 ggggccggcg gaaagggcgg ccgaggcggg 
ggcacaaccg gcagccaagg cgccggcggc 
cccggtgacg gcggcaacgg cgctaacggc 
aacggcggca acggcggcaa cgccggcccc 
tccactfctcg gcgcgaccgg ctcaagcagc 

40 aacggcggca acggcgacca tgccctcagc 
aacggcggca acggcagcct gcgcggcagc 
ggcaatgcca gcaggggcat gggcggtgac 
gggcagattg gcaacggcgg agcgggcggc 
ggcaatcccg gagccattac cggcagcggc 

45 caaggtggga gcgtcgccgg cgacggcgct 
ggcgggacgg gcctgcgggg caccaccggc 
ggcgcggacg gtcacggcgg taacggtggc 
ggcggcggcg gcggcaacgg cggggccggc 
ggcagccaag gcgccggcgg agatggtggg 

50 ggcggcgacg gcggcagagg tgcccacggc 
gggactggcg gaaatggcgg caccggcggc 
ggcgggaccg gttccaccct gggcgcgacc 
aacggtggcg tcggcggcag tggcggcctt 
ggcatgggcg gcgccggcgg caccagcacc 

55 ggcttcggcg gcgacggcct tgatgccagt 
ggtggcgacg gcttcaggac cgctggggcg 
gccgaccccg gcggtttatt cccgatccct 
accggcggca ccgcacacct cgggcccctg 
cagttcggca gccccggcgc cgacggccgc 

60 ggcgccggcg gcagcttcta g 
<212> Type : DNA 
<211> Length : 3921 

SequenceName : SEQ ID 523 
SequenceDescription : 

65 

Sequence 



ccaggcggca tctfcgtacgg caacggcggc " '£20? 

ggcggggccg gcggggcggc gggcctaatc 48 0 

gtgggtacca ccggcggggc cggtggtcac 540 

ggaggcgccg ggggttttgg cggggccggg 600 

accgccgggt tgttcggtgt cggcggggcc 660 

gtcacgggta cgtcggccag cacaccgggt 720 

atcggcggca acggcggggc cggcggggcc 780 

ggcggggccg gcggcgaggg cgggcccggc 840 

cacgccacca acttgggcgc tgacggtcaa 900 

ggcgggaccg gcggggtcgg cgggcccggc 9 60 

agccacggcg ccggcggggc tggcggtagc 102 0 

ggcaacggcg ctaccgggac gtggggtcac 10 8 0 

ggcggcaacc cgggcgccgg tggggccggt 1140 

gcgcatggcg cgaacggogc gcccggcacc. 12 00 

ggcggcaaag gcgccgacgc tatcagcagc 12 60 

ggcgacggcg gtcaggtggg taatggcggt 13 2 0 

ggcggtctcg gatttggatc cgaagcgccc 13 80 

ggggccggcg gcaacggcgg aactcaggcc 1440 

gccggcggcg atggcggatc cggaggtgct 15 0 0 

ggggccgcag gctcacccgg cgggaatggc 1560 

gagggcgggg ccggcggtct cgcgttggcc 162 0 

ggcggtgacg gcggagccgg cgggaacggc 168 0 

gccggcgcgc tcggtgtcaa cggtggtgta 1740 

ggtgtcggcg gtgccggcgg ccagggcgga 18 0 0 

cccggcaaca cccccaccag cggcggcaac 1860 

accggctttg gccagaccgg cgcgtccggc 192 0 

aacggcggcg ccggcggcgc cggcggcaac 1980 

ctcggcaacc ccggcctgga tggcggcacc 2 040 

ggcgcctggg cgggcaacgg tggcaccggc 2100 

actggagggt ccggcagcga cggtgtcaac 2160 

ggcggcaccg gcggagttgg cggtaccggc 2220 

gccgcaccca acggcgtcgc cggcagccaa 2280 

accggcggag ttggcggtaa cggcggccgc 2340 

ggtgcccgcg gccaggatgg cggcgccggc 2400 

accggcgggc cgggcggggc cgggccggcg 2460 

aacggcggca gcggcggcac cggcggggac 252 0 

agcgtgttca ccaacaacgg catcggcggc 2580 

agcggggccg gcgggagtgg cggcgccggc 2640 

agcattcatg tcaacggcgg caacggcggc 270 0 

ggcaacggcg cggccggcgg caacggcggc 2760 

ggtggggccg gcggccacgg cggcaacggc 282 0 

ggcggaaccg gtggcgccgg cggaaacgcc 288 0 

aatggcggag acggcggcac cggtagtgac 294 0 

ggccgcggcg gcgacggcgg cgtaggcggg 3 00 0 

gacggcggcc ggggcggggc cggcggcacg 3 060 

gccaccggtg cgaccggcac attcgacgct 3120 

accggcgggg ttggcgggac cggcggcgcc 3180 

ggcaaagcgc tgtcgccaac gggcaacaac 3240 

gccggtggcg ccggcggcac cggcggcacc 33 00 

actctcttca gcagcctcgc tggcactggc 3 3 60 

accggcggca ccggtggtgc cggcggtgcc 3420 

ggtgcgacag gggcggccgg ccgcgccgga 3480 

ggctccgcct ttggccccgg cggcaccggc 3540 

gtcagcgccg gcggtgacgg cggaaggggc 3 60 0 

tccggtggga atggcgggga cggcgggcac 3 660 

ggcgggcgag gtggggacgg cggcaaaggg 3720 

ggggctggcg gtaagggtgg caccggtggc 378 0 

gccatcatcg gccaatccgg ccagcccggc 3 840 

ggcggggccg ggggcgcagg cgggggcggc 390 0 

3921 
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<213> OrganismName : Mycobacterii 
<400> PreSequenc eSt ring : 
atgtcggcgg ccgcggtcgc ctgggaccaa 
tctttcaact ccgtgacgtc gggcctggtc 
5 gcgatggccg ctgcggtagc gccgtaccta 
cagcggtcgg caacccaagc cgcggccctc 
atggtgcaac cggcgctggt ggcggccaac 
aacttcttcg ggcagaatgc tccggcgatc 
tgggccatcg atgtgtcggt gatgtcggcc 
10 gccctgacgc cgttcactgc gccgccgcag 
gccgctccgg cggccgtcgt caccgcggcg 
cttagcttag gcctggcaaa ctcgggcttc 
ttgaacttgg gcagcctaaa tcccggcggc 
aacaacgttg gcttgggcaa caccggcaac 
15 ggaaacatcg gcttcggcct caccggcgac 
gggaccggca atatcggctt gttcaactca 
ggcaccggaa acttcggcat cgggaactca 
acgggccaag ccaacacggg cttcttcaac 
acgggcaact acaacacggg cagcttcaat 

20 acgggcagct ccaacacagg cttcttcaac 
accggcaacg tcaacaccgg tggattcaac 
cgaggcgact accagggctt gatcggcttc 
ctagacctca acggcctcgg ctccgtcggc 
gaaatcggcc tgggcattaa cagttccgga 

25 attactgttc ccgccatcgg gctgggcatc 
aacatcccgc cgatcactct aaattctatt 
aacgtgggat cgatttcgat ccccgcgtct 
cccaccgttg gcagcatagg cccgggtagc 
cccgcgattc cgccattttt ccccggattc 

30 atcggtccca tcaccttctt atcgggtggt 
gatgtgggtg gagggttagg cccgtttacg 
atcccgttgg gtattgatgt gggcggaggg 
tactcacttc cggcaat ccc gttgggtatt 
gtgttcccgg atggctactc acttccggca 

35 atcggccccc tcactacccc gccaatcacg 
tccggcagcc tcgggccgat caacatcccg 
aactcgacca ccaccccgtc atcgggcttc 
ttcgggaacg tcggttcggg cggatctggc 
tccggattcc ttaacgtcgg gccactgaca 

40 tcaggcctct acaacaccag cacgctgggc 
ggtaacaccg acagccaact cgccggcttc 
aacttcggct tcgccaacga cggcacactc 
aacgtgggta gcggaaacgt cggtagctac 
agtttcggtt tcggaaacat cggaagcaac 

45 aaccttgggt ttgcaaacac gggtccgggg 
gggaacatcg gcggcaacaa ctacggcttc 
ggcaacacgg gcactggaaa tattggtatc 
99ggcgctga actccggcag cggcaacatc 
ggcttcttca actcaggcaa cggaaacgtc 

50 ggcctgggta acgtgggcaa cgccaacacg 
ggaatcggca acgcaggaag ctacaacaca 
ggcgacctca acccgggcaa cgccaacacg 
ggctggggaa acattggcga ccttaacacc 
ggcatactgt ggaggggcga ttaccagggt 

55 cccgccatcc cactgagcgt cgaagtgaat 
attactattc ctggtattcc gttgagcctg 
gtgccggata ttactattcc tggtattccg 
ccgattgtgg tgccggatat tactattcct 
ggtgtcggtc cgattgtggt gccggatatt 

60 gcgctgggtg gtgtcggtcc gattgtggtg 
agcctgaacg cgctgggtgg tgtcggtccg 
atccccctta cgattaacat caggataccg 
aacgtcgctg gtatcttcac gggctacatc 
ggcgtcacgc tggccggcgg caccctggag 

65 ttcggtttga atattccgct gagcggtgct 
attaatccgt ttgggttgaa tgttccgttg 
ggtttcgcga ttaatccgtt tgggttgaat 



im tuberculosis H3 7Rv 

ctggcgatgg aattggcctc ggcagcggcc 60 

ggcgaatcgt ggctcggacc gtcatcggcg 12 0 

ggatggcttg ccgcggcagc ggcccaggcc 18 0 

gtggccgagt tcgaggctgt ccgggcggcg 240 

cgctccgacc tggtgtcatt ggtgttctca 3 00 

gctgcgattg aggccgcata cgaacagatg 360 

taccatgccg gggcatcggc ggtggcgtcg 420 

aatctgacag acctgcccgc ccagttggcg 480 

atcaccagtt ccaagggtgt gctggcaaat 540 

ggacagatgg gcgccgctaa ccttggcatt 600 

aacaacttcg gccttggaaa tgtcggcagc 660 

ggaaacatcg gcttcggccia cacgggcaac- 72-0 

aaccagcagg ggttcggcgg ctggaactcg 78 0 

ggcaccggca acatcggcat cggcaatacg 840 

ggtaccagct acaacacggg tatcggcaac 90 0 

gccggcatcg ccaacactgg catcggcaac 960 

ctaggcagct tcaacacggg cgacttcaac 102 0 

cccggcaacc tcaacaccgg cgtgggaaac 108 0 

tctggcaact acagcaacgg cttcttctgg 1140 

tccggcacac tgaccattcc cgctgctggc 12 0 0 

cccatcacca tcccgtccat caccattccc 1260 

gcgttggtgg ggccgatcaa tgttccgcct 1320 

aacagtaccg gggcactcgt tggccccatc 13 8 0 

ggcctagagc tatcggcgtt ccaggtcatt 144 0 

ccgcttgcga tcggcttatt cggcgtaaat 1500 

atatcgatac agctaggcac tcctgagatt 1560 

cctccagatt acgtgacagt gagtggtcaa 162 0 

tatagtttgc cggctattcc gttgggtatt 168 0 

gtgttcccgg atggctactc acttccggca 1740 

ttaggcccgt ttacggtgtt cccggatggc 180 0 

gatgtgggcg gagggttagg cccgtttacg 1860 

atcccgttgg gtattgatgt gggcggcgcc 1920 

atcccctcaa tcccgttggg catcgacgtg 1980 

atcgaaatcg cgggcacccc aggcttcgga 2040 

ttcaacagcg gtacgggcgg cacatcgggt 2100 

ttctggaaca ttgctgggaa tctcggcaac 2160 

tcgggaatct tgaacttcgg caacacagtc 222 0 

ctagcgacat cggcctttca ctccggcgtc 2280 

atgcgcaacg ccgcaggtgg gacgttattc 2340 

aacttgggca acgcaaacct cggcgactac 240 0 

aacttcggta gcggaaacat cggcaacgga 246 0 

aacttcggtt ttggaaacgt cggcagcaac 252 0 

ttgacggagg ccctgcacaa cattggcttt 258 0 

gcgaacatcg gtaacggcaa catcggcttc 2640 

gggctcaccg gcgacaatca ggtcggattc 270 0 

ggcttcttca actccggcaa cggcaacatc 2760 

ggcatcggca actccggcaa ctacaacacc 282 0 

ggcctgttca acaccggcaa cgtcaacact 2880 

ggcagctaca acgccggcga caccaacacg 2940 

gggtacctaa acctcggcga cctcaacacc 3000 

ggcgccctga tctcgggcag ctacagcaac 3 060 

ctgattggct actcagacac actcagcatt 312 0 

ggtggcatcg gtccgattgt ggtgccggat 318 0 

aacgcgctgg gtggtgtcgg tccgattgtg 3240 

ttgagcctga acgcgctggg tggtgtcggt 33 00 

ggtattccgt tgagcctgaa cgcgctgggt 3 360 

actattcctg gtattccgtt gagcctgaac 342 0 

cctgatatta ctattcctgg tattccgttg 3480 

atcaccgttc ccggcgtccc tatttcccgc 354 0 

gtcaacatca ctctcaacga acttccgttt 3 60 0 

ggccccatcc cgcttagcac attcgtatta 3660 

tctggcatcc agggattcag tgttaatccg 3720 

accaacgctg tcacgatccc tggtttcgcg 378 0 

a 3cgggggca cgagcccggt tacgatccct 3 84 0 

gttccgttga gcgggggcac gagcccggtt 3900 
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acgatccccg gcttcaccat tcccggatcc cccctgaact tgaccgccaa cggcggcttg " 3 960 

ggaccgatca acatcccgat eaacatcacg agcgccccgg gcttcggaaa ctccaccacc 4020 

accccgtctt cgggcttctt caacagtggc gatggaagcg catccggctt cggcaacgtc 4080 

gggcccggca tttcgggcct ctggaaccag gtgccgaacg cgctgcaagg cggagtctcg 4140 

5 ggaatctaca acgtcgggca gctggcgtcg ggcgtggcga acctaggcaa caccgtctcg 42 0 0 

ggcttcaaca acacgagcac cgttggtcac ctcaccgctg cgtttaactc gggcgtcaac 4260 

aacatcggcc aaatgctcct gggcttcttc tcaccgggtg ccgggccgta a 4311 

<212> Type : DNA 
10 <211> Length : 4311 

SequenceName : SEQ ID 524 
SequenceDe script ion : 

Sequence 
15 

<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> Pre Sequenc eSt ring ; 

atggagtttc cggtgttgcc accggaaatc aactccgtgc tgatgtattc gggtgcgggg 60 

tcgagcccgt tgctggcggc ggccgcggcg tgggatgggc tggctgagga gttggggtcg 12 0 

20 gcggcggtgt cgtttgggca ggtgacgtcg ggcctgacgg cgggggtgtg gcagggtgcg 180 

gcggcggcgg cgatggcggc cgcggcggcg ccgtatgcgg ggtggttggg ttcggtggcg 240 

gccgcggccg aggcggtggc cgggcaggcg cgggtggtgg tgggggtctt tgaggcggcg 3 00 

ttggcggcga cggtggatcc ggcgctggtg gcggccaacc gggcgcggct ggtggcgttg 3 60 

gcggtgtcga atctgttggg gcagaacacg ccggcgatcg cggccgccga ggccgagtac 420 

25 gagctgatgt gggccgccga tgtggcggcg atggccggct accattccgg cgcgtcggct 480 

gctgccgcgg cgttgccggc gttcagccca ccggcgcagg cgctgggggg aggtgtcggc 540 

gcgttcctta ccgccctgtt cgccagccct gcgaaggcgc tgagcctgaa tgcgggtttg 600 

ggcaatgtcg gcaattacaa cgtcgggttg ggcaatgtcg gggtgttcaa cctgggcgcg 660 

ggcaatgtgg gtgggcagaa tctgggtttc gggaatgccg gtggcaccaa tgtcgggttc 720 

30 ggcaacctcg gtaacgggaa tgtcgggttc ggcaactccg gtctgggggc gggcctggcc 7 80 

ggcttgggca atatcgggfct gggcaatgcg ggcagcagca actatggttt cgcaaacctg 840 

ggtgtgggca acatcggttt cggcaacacc ggcaccaaca acgtcggcgt cgggctcacc 900 

ggcaaccacc tgacgggtat cgggggcctg aattcgggca ccgggaatat cgggttgttc 960 

aactccggca ccgggaatgt ggggttcttc aattcgggga ccgggaactt cggggtgttc 1020 

35 aactcgggta attacaacac cggtgtcggt aatgcgggga cggccagcac ggggttgttc 1080 

aatgccggca atttcaacac cggcgtggtg aacgtgggca gttacaacac cggcagtttc 1140 

aacgccggcg acaccaacac cggtggcttc aaccccggcg gtgtgaacac cggctggctg 1200 

aacaccggca acaccaacac cggcatcgcc aactcgggca acgtcaacac cggcgcgttc 1260 

atctcgggca acttcaacaa cggcgtgctg tgggtgggtg actaccaggg cctgttcggc 1320 

40 gtctccgccg gctcgtcgat ccccgcaatt cccatcggcc tggtgctcaa cggcgacatc 1380 

ggcccgatca ccatccagcc catcccgatc ctgcccacca tcccgctcag cattcaccaa 1440 

accgtcaact tgggcccgct ggtggttccc gacatcgtga tccccgcctt cggcggcggt 1500 

atcggcatac ccatcaacat cggcccgctg accatcacac ccatcaccct gtttgcccaa 1560 

cagacatttg tcaaccaatt gccctttccc accttcagtt tagggaaaat cacaattcca 1620 

45 caaatccaaa cctttgattc taacggtcag cttgtcagct ttatcggccc tatcgttatc 168 0 

gacaccacca ttcccggacc caccaatcca cagattgatt taacgatcag atgggatacc 1740 

cctccgatca cgctgttccc gaatggcatc agtgctcccg ataatccttt ggggttgctg 1800 

gtgagtgtgt cgatcagtaa cccgggcttt accatcccgg gatttagtgt tcccgcgcag 1860 

ccgttgccgt tgtcgatcga tatcgagggc cagatcgacg ggttcagcac cccgccgatc 192 0 

50 acgatcgatc gcatccccct gaccgtgggg ggcggggtca cgatcggccc catcacgatc 1980 

cagggccttc atatcccggc ggcgccggga gtggggaaca ccaccacggc cccgtcgtcg 204 0 

ggattcttca actccggtgc gggtggggtg tcgggtttcg gcaacgtcgg cgcgggcagc 210 0 

tcgggctggt ggaaccaggc gccgagcgcg ctgttggggg ccggttcggg tgttggcaac 2160 

gtgggcaccc tgggctcggg tgtgctcaac ctgggctcag ggatctcggg gttctacaac 222 0 

55 accagcgtgt tgcctttcgg gacaccggcg gcggtgtcgg gcatcggcaa cctgggccag 228 0 

cage tgt egg gggtgtcggc ggegggaace acgctgcgct cgatgctcgc cggcaacctc 2340 

gggttggcca atgtgggcaa cttcaacacc gggttcggaa atgtcgggga cgtcaacctg 240 0 

ggtgcggcca acatcggtgg gcacaacctg ggcctgggca atgtcgggga cggcaacctg 2460 

gggttgggca acatcggcca tggcaacctg gggtttgcca acttgggect gaccgccggc 252 0 

60 gcggcggggg tgggcaatgt tggttttggc aatgccggca tcaacaacta tggcttggcg 2580 

aacatgggtg tgggcaatat tgggtttgcc aacaccggca egggcaacat egggateggg 2640 

ctggtcgggg accatcggac egggateggg ggcttgaact ccggcatcgg caatateggg 2700 

ttgttcaact ccggcaccgg caaegteggg ttcttcaatt ccgggaccgg caacttcggc 2760 

ategggaact ccggccgctt caacaccggg ateggtaata geggaaegge cagcaccggg 282 0 

65 ctcttcaatg ccggcagctt cagcaccggc atcgccaaca ctggtgacta caacacgggc 288 0 

agcttcaacg ccggcgacac caacaeeggt ggcttcaacc egggeggcat caacaccggc 294 0 

tggttcaaca ccgggcatgc caacaccggg ttggccaacg cgggcacctt cggcaccggc 3000 
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gccttcatga cgggcgacta cagcaacggc 
gtcggcgtcc gcgtcgggcc cacgatctcc 
ggggtgggcc cgctgcatgt ggcgcccgtc 
gacgccaccg tcggcctggg tccgttcacc 
5 atcgccagca tcaccggaag cgtggacctg 
cttgacccgc tcgccggttc gatagggctt 
tttatcacca ttgatgcgtt ccaagttgtt 
gtgcccggcc tcacggttag cggtcagata 
ctcaacttgg acaccacccc gtggacgctt 

10 acccccgtga cggtgggtat ggaggtcgcc 
ctgacctttc cgcgggcctc cgccggggtc 
acgctgttgc ccgacgggtt caccctcgac 
ctcatcggcg atatcccgat cccgatcatc 
accaccacgg ccccatcgtc ggggttcttc 

15 gccaacgtcg gcgcgggcac gtcgggctgg 
gcgggctcgg gagttgccaa tgccggcacg 
gggatctccg ggtggtacaa caccagcacc 
ggcatcggca acctcggcca gcagctgtcg 
cggagcccca ttgtcaatat cgggtgggcc 

20 aatgtggggg acctcaactg gggtgcggcc 
aatctcggca gcgggaacgt cgggttcggc 
aactcgggtc cggcggtggg cctggccggc 
agcaacaact gggggctggc caacctgggt 
acgggcaaca tcgggatcgg gctggtcggc 

25 tcgggtagtg gcaatatcgg attgttcaat 
accggcaccg gcaacttcgg actgttcaac 
agcggaaccg gcagtactgg gctcttcaat 
cccgggtcgt acaacacggg cagcttcaat 
ccgggcgaca tcaacaccgg ctggttcaac 

30 accggcgccc tcatgtcggg gaccgacagc 
ggcctgttcg gcctgtccta tggcatcacg 
actggcggta tcggccccat cgtcatcccg 
cagatcaccg gcgacgcgga ctacagcttc 
cacatcggca tcaatggcgt cgtcaccgtc 

35 tccgccctga agaataacgg tagcttcatc 
gatattccgc ccatggattt cacgttaggc 
ctcggaccaa ttcatcttga gccaatcgtg 
gagcccatcc ccctggatgc gatttcgttg 
gatattccgg cctcggtcat cgatgggatt 

40 tccgtggaca tcccggcggt cacgatcaca 
ttcgacattc gcaccagtgc cggacccctc 
ccgggcttcg ggaactcgac ccagatgccg 
ggcggatcgg gcatcggcaa cttgggtgcg 
gcggggtcac tggtggggac actctcgggg 

45 gtgctgaact ccggcaccgc catctccggg 
accccggcgg tgatctcggg gttcagcaac 
gatggcctga tcgcgatccfc caccttccca 
gacgcggcca tcgccgagct gcagcacctc 
gtcggcgggg tgaacctcgg tttggctaac 

50 gtcggcaaca tcaacgtcgg cgccggcaac 
gtcgggaccg gcaacctcgg gttcggcaac 
gcgggcctga ccgcgggcgc ggggggcctg 
ggcagctggg ggttggccaa cgtgggtgtg 
ggcaacatcg ggatcgggct gaccggggac 

55 ggcaccggga acctcgggtt gttcaactcg 
gggaccggga acttcgggct gttcaactcg 
. ggcacggcca gcaccgggtt gttcaacgcg 
ggctcctaca acaccggcag cctcaacgtg 
ggcaccgtca acaccggctg gttcaacacc 

60 ggcaacgtca acaccggcgc gttcaactcc 
ggtgactacc acgggctggt cggcttctcc 
ctggacctca acgaaaccct caacctgggc 
ggcatgtcgc tgttcgacgt ccacgaaatc 
gtcgatgttc ccgcgatacc gctagagatc 

65 ctggtgcccg ccaccacaat tcccgcacag 
tcacccgggt caaccatgac gcttccgctc 
atcctcgggt cgaccgcggc gattcccaat 



ctgttgtggc ggggcggcta cgagggcctg 3 060 

caattcccgg tcaccgtgca cgcgatcggc 3120 

ccggtacccg ccgtgcacgt cgagatcacc 3180 

gtcccaccga tcagcattcc ctcacttccc 3240 

gccgcaaaca ccatctcgcc gattcgcgct 33 0 0 

tttctcgagc cgttccgcct cagtgaccca 33 60 

gccggtgtct tgttcctaga gaacatcatt 342 0 

ttggtcaccc cgacaccaat tcccctaacc 348 0 

ttcccgaatg gtttcaccat tcccgcgcaa 3 540 

aacgacgggt tcaccttctt cccgggtggg 360 0 

accggactgt ccgtggggct ggacgcgttc 3 660 

accgtgccgg cgaccttcga cggcaccatc 372 0 

gatgtgccgg cggtgccggg gttcggcaac 3 780 

aacaccggcg gcggcggtgg atcggggttc 3840 

tggaaccagg ggcacgacgt gttagcaggg 39 0 0 

ctgagctcgg gcgtgctgaa cgtcggctcg 3960 

ctgggagcgg gcaccccggc ggtggtctcg 402 0 

gggttcttgg caaatgggac cgtgctcaac 408 0 

gatgtgggcg cgttcaacac cgggttgggc 4140 

aacatcggcg cgcagaacct gggcctgggc 42 00 

aacatcggtg ccggcaacgt cgggttcgcc 42 60 

ctgggcaacg tggggttgag caatgccggc 4320 

gtgggcaaca tcgggttggc caacaccggc 43 8 0 

gactaccaga ccggcatcgg cggcctcaac 4440 

tccggcaccg gcaatgtcgg gttcttcaac 4500 

tccggtagtt tcaacaccgg catcggtaat 4560 

gccggcaatt tcaacaccgg catcgccaac 462 0 

gtcggtgata ccaacaccgg tggtttcaac 4680 

accggcatta tgaatacggg cacccgcaac 4740 

aacggcatgc tgtggcgcgg cgaccacgag 48 00 

atcccgcaat tcccgatccg catcaccacg 4860 

gacaccacga tccttccgcc gctgcacctg 492 0 

accgtgcccg acatccccat ccccgccatc 4980 

ggcttcaccg ccccggaagc caccc tgctg 5040 

agcttcggcc ccatcacgct ctcgaatatc 5100 

ctgcccgttc ttggtcctat cacgggccaa 5160 

gtggccggga tcggtgtgcc cctggagatc 5220 

agtgagtcga ttcctatccg catacctgtt 52 80 

tcaatgtcgg aagtggtgcc gatcgatgcg 5340 

ggcaccacca tttccgcgat cccgctgggc 540 0 

aacatcccga tcatcgacat cccggcggcg 5460 

tcgtcggggt tcttcaacac cggtgccggc 552 0 

ggcgtgtcgg gcctgctcaa ccaggccggc 5580 

ctgggcaatg ccggcaccct ggcctcgggt 5640 

ctgttcaacg tgagcacgct ggacgccacc 5700 

ctcggcgacc atatgtcggg ggtgtccatc 5760 

cctgccgagt ccgtgttcga tcagatcatc 5820 

gacatcggca acgctttggc cttgggcaat 5880 

gtcggtgagt tcaacctggg tgcgggcaac 5940 

ctcggcggca gcaacttggg gttgggcaac 6000 

atcggtgccg gcaatttcgg attcggcaac 60 60 

ggcaatgtgg ggttgggtaa cgccggcagc 6120 

ggcaatatcg ggttggccaa caccggcacc 6180 

tatcggaccg ggatcggcgg cctgaactcg 6240 

ggcaccggca acatcgggtt cttcaacacc 63 00 

ggcagttaca gcaccggtgt ggggaatgcg 63 60 

gggaacttca acaccggtct ggccaatgcc 6420 

ggcagcttca acaccggcgg cgtcaacccg 64 80 

ggccacacca acaccggcct gttcaacacc 6540 

ggcagcttca acaacggggc gctgtggacc 660 0 

ttcagcatcg acatcgccgg cagcaccctg 6660 

cccatccaca tcgagcagat cgacatcccc 672 0 

gtcgagatcg gacccttcac catcccgcag 6780 

cacgaatcga tccacatgga tcccatcgtc 6840 

acgagaacca ttccgctgga catccccgcc 6900 

atcagcatgc gcttcgaagg cgaggactgg 6960 

ttcggagacc ccttcccggc gcccacccag 7020 
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ggcatcacca ttcacaccgg ccctggcccc 
ccgggtttcg agattccgca aatcgctacc 
ggtggtctgc cggccttcac cttgttcgcg 
ccgttaacga tcgatgcgtc cggcgcgctg 
5 acgatcgacc cgctgccgct gcacctggcg 
ccgatcatcg atgtcccgcc gacgccaggg 
gggttcttca actccggcgc cggtggggtg 
tcgggctggt ggaaccaggc ggcgagcgcg 
gtcggcacgc tgggctcggg tgtgctcaac 

10 accagcgtgt tgccgctcgg gacgccggcg 
cagctgtcgg gcgtgtctgc ggccgggacc 
gggttggcgg atgtgggcaa cttcaacgtc 
ggcgcggcca acctcggtgc gcaaaacctg 
ggcttcgcca acgtcggcca cggcaatatc 

15 gcggccggcc tgggcaacac ggggttcggc 
aaccagggcg tgcgcaacat cgggttggcc 
ctggtggggg acaacctcac cggcatcggg 
ttgttcaact ccggcaccgg caacatcggg 
atcggtaact cgggcagctt caacaccggc 

20 ctcttcaatg ccggcagctt caacaccggc 
agcttcaatg ccggcgacac caacaccggg 
tggttcaaca ccggccacac caataccggc 
gcgttcatgt cgggcaactt cagcaacggc 
ttcagcctgt tctacagcct cgacgtgccc 

25 ggcggcttcg gacccgtggt cctcccgccc 
accggaaacg tcgcgatggg cgcattcacc 
ccaaacatca ccggaagcgc cgccttccgc 
gtgagtgtca ttgtggagca aataatcaac 
cccttcgaaa tgtggactca aggcactaat 

30 tcggccgacg gttcgcccta cgccaccggc 
ggaagccatc tcaccatttc cgcgtccagc 
ggcccgatca cgttgggctt ccaggtgccc 
ggtggtttga cgttcccggc gacctcgctg 
ggcgtggaca tcccggccat cacctggccc 

35 tatgtcctcg ccagcagcat cccgctgatc 
agcaccatca ccccgtcgtc gggcttcttc 
ggcaacttcg gcgcgggcac ctcgggctgg 
gcgggctcgg gttttgccaa cgttggcacg 
ggtgtctcgg ggatctacaa caccagcacg 

40 ggcctgggca acgtcggcca ccaactgtcg 
ccggtgaccg ttctgaatat cgggttggcc 
aatgtcgggg aggtcaacct gggcgcggcc 
aatatcggcg ccggcaacct ggggttcggc 
aactcgggtc tgaccgcggg cgtgccgggc 

45 ggcaacaact gggggttggc caacgtgggc 
accggcaaca ttgggatcgg gctgaccggc 
tccggtgccg gcaacctggg gttgttcaac 
accgggaccg gcaacttcgg gttgttcaac 
agcggaacgg gcagcactgg gctcttcaat 

50 gccggcagct acaacacggg cagcttcaat 
ccgggcagca tcaacaccgg ctggctcaac 
gcgggcaatg tcaacaccgg cgccttcgtc 
cgcggcgact accagggcct ggccggcttc 
gcggtgggcg ccgacgtcag cggcgggatc 

55 atcccgccca ttccggtcgg cttcgccgcg 
gacatctctg ttccatccat tcacttgggc 
accgtcaacc ccattaccgt caggaccccg 
gtcaccagca cgtccggacc aacctcagag 
atccggatcg cgccctctag cggcgggggt 

60 gggcccatct ccatcccctc cggcacggtg 
ccgatcgaca tcggcctgcc ggtgtcgctg 
accctgatcc ccaccctccc gctgggcctc 
atcccggcca tcgttctcga ccggatcttg 
ccgatcaacg tcccgatcgc cgggttcggc 

65 ctgccgtcgt cgggcttctt caacaccgga 
ggcgcgggca tgtcgggatt gctcaacgcg 
ggcttcgcca acttcggcac ccagctctcc 



ggaacgaccg gcgagctcaa gatatctatt 70 80 

acgagattcc tgttggacgt gaacatcagc 714U 

ggtggcctga cgatccccac gaacgccatc 72 00 

gatccgatca cgattttccc gggtgggtac 72 60 

ctgaatctca ccgtgcccga cagcagcatc 73 20 

ttcggcaaca ccacggcgac cccgtcgtcg 73 80 

tcggggttcg gaaacgtcgg gtcgaacctg 7440 

ctggcggggt cgggatcggg ggtgttgaat 7500 

gtcggctcgg gtgtctcggg gatctacaac 75 60 

gtgctgtcgg gcctcggcaa cgtcggccat 7620 

gcgttgaacc agatccccat cctcaacatc 768 0 

gggttcggca acgtcgggga cgttaacctg 7740 

gggctgggca acgtcggcac cggcaacctc 78 00 
ggtttcggca attcgggtct - gaccgccggc * 7£?60 

aatgccggca gcgccaacta tggtttcgcc 792 0 

aacaccggca ccggcaacat cgggatcggg 7980 

ggcctgaact ccggtgccgg caatatcggc 8040 

ttcttcaact ccgggaccgg caacttcggc 810 0 

atcggcaata gcggaacggg cagcactggg 8160 

gtggccaacg ccggcagcta caacaccggc 822 0 

gggttcaacc cgggcaccat caacaccggc 82 8 0 

atcgccaact cgggcaacgt cggcaccggc 8340 

ctgttgtggc ggggtgatca cgagggcctg 8400 

cggatcacca tcgtggacgc ccacctcgac 8460 

atcccggtgc cggccgttaa tgcgcacctg 852 0 

attccgcaga tcgacatccc cgcactcacc 8580 

atcgttgtgg ggtccgtgcg cattccgccg 8640 

gcctcggttg gggcggagat gaggatagat 87 0 0 

ggccttggta taaccttcta ttcattcgga 8760 

ccactcgttt tcggcgccgg cacgagcgac 8820 

ggggcgttta ccactccgca gctcgaaact 888 0 

ggcagcgtca acgcgatcac cctcttcccc 8940 

ctgaacctgg acgtgaccgc cggcgccggc 9000 

gagatcgcgg cgagcgccga cggctcggtg 9060 

aacatcccgc ccaccccggg cattgggaac 9120 

aacgccggcg cgggcggggg atcgggcttc 9180 

tggaaccagg cgcacaccgc gctggcgggg 9240 

ctgcattccg gtgtgctcaa cctgggctcg 93 00 

ctgggggtgg ggaccccggc gctggtctca 93 60 

gggctgcttt ccggcgggtc cgcggtgaac 942 0 

aacgtcggca gccacaacgc cggtttcggc 948 0 

aacctcggcg cgcacaacct gggcttcgga 9540 

aatattggcc acggcaatgt cggagtcggc 9600 

ctgggcaatg tggggttggg caatgccggc 9660 

gtgggcaata tcgggttggc caacaccggc 972 0 

gactaccaga ccggcatcgg cggcctaaat 9780 

tccggcgccg gcaacgtcgg gttcttcaac 9840 

tccggcagct tcaacaccgg cgtcggcaat 99 00 

gccggcagtt tcaacaccgg tgtggccaac 9960 

gtcggtgaca ccaacaccgg gggcttcaac 10020 

gccggcaacg ccaacaccgg ggtggccaac 1008 0 

accggcaact tcagcaacgg catcctgtgg 10140 

gccgtgggct acaccctccc gctgttcccc 102 00 

ggcccgatta ccgtgctgcc gcccatccac 102 60 

gtcggtggca tcggcccgat cgccatcccg 10320 

ctcgaccccg ccgtccatgt cggctccatc 103 80 

cccgtgctcg tcagttactc ccaaggagcc 10440 

atttgggtca agcccagctt cttccccgga 10500 

gcaacgtcca cgcaaggggc atactttgtg 10560 

accttcccgg gattcaccat ccccctcgac 10620 

accatcccgg ggttcaccat cccgggcggc 10680 

gcgttgtcca atggcatccc gcccgtcgac 10740 

ctggacctgc acgccgacac cactatcggc 108 00 

ggggcgccgg gtttcgggaa ctcgaccacg 108 60 

gctggcggcg gttcgggctt tagcaacacc 1092 0 

atgtcggatc cgctgctcgg gtcggcgtcg 10980 

ggcatcctca accgcggcgc cggcatctcg 11040 
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ggcgtgtaca acaccggcgc gctgggtgtt gtcaccgcgg ccgtcgtctc gggtttcggc 11100 
aacgtcggcc agcaactgtc gggcttgctc ttcaccggcg tcgggcccta a 11151 



<212> Type : DNA 
5 <211> Length .- 11151 

SequenceName : SEQ ID 
SequenceDe script ion : 



525 



Sequence 
10 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

ttgaattttc cagttctgcc accggaaatc aactccgtgc tgatgtattc 
tcgagcccgt tgctggcggc ggccgcggcg tgggatgggc t§gct&agga 

15 gcggcggtgt cgtttgggca ggtgacgtcg ggcctgacgg cgggggtgtg 
gcggcggcgg cgatggcggc cgcggcggcc ccgtatgcgg ggtggttggg 
gcgcaggccg tggcggtggc cgggcaggcg cgggccgcgg tggcggcgtt 
ttggcggcga cggtggatcc ggcggcggtg gcggtcaacc ggatggcgat 
gcgatgtcga acctgctggg gcagaacgcc gcagcgatcg cggccgtcga 

20 gagttgatgt gggccgccga tgtggcggcg atggccggct accattccgg 
gctgccgcgg cgttgccggc gttcagccca ccggcgcagg cgttgggggg 
gcgttcctca atgctctatt tgccggaccc gcgaagatgt tgaggcttaa 
ggcaatgtcg gtaattacaa cgtcgggttg ggcaatgtcg ggatattcaa 
gccaatgtcg gtgcgcagaa tttgggtgct gccaacgccg gtagcgggaa 

25 ggcaatatcg gcaacgccaa cttcgggttc ggcaactcgg gtcttgggtt 
atgggcaata ttgggttggg caatgcgggc agcagcaact acggcctcgc 
gtgggcaaca tcggttttgc caacacgggt agcaacaaca tcgggatcgg 
gacaacctga ctggcattgg gggcctgaat tcaggaaccg gtaatctggg 
tccggcaccg gcaacattgg gttcttcaat tcggggaccg gcaacttcgg 

30 tcgggcagct acaacaccgg tgtcggtaat gcggggacgg ccagtaccgg 
gttggtgggt tcaacacggg tgtggccaac gtgggtagct ataacacggg 
gcgggcaaca ccaatacggg tggcttcaac ccgggcaacg tcaacaccgg 
accggcaaca ccaacaccgg catcgccaac tcgggcaatg tcaacaccgg 
tcgggcaact tcagcaacgg tgtgctgtgg cggggtgact acgagggcct 

35 tccggtggat cgaccattcc ggcgatcccc attggtctcg agctcaacgg 
cccatcaccg tgttgccgat ccagattttg cccaccatcc cgctcaacat 
ttcagcctcg gcccgctggt cgttccggac atcgtgatcc ccgcttttgg 
gccataccta tcagcgtcgg ccccatcacc atctcgccca tcaccctgtt 
aacttcaaca cgactttccc cgtcggcccc ttctttggct tgggggtcgt 

40 ggaatcgaaa tcaaagatct tgccggcaac gtcaccctcc aattaggtaa 
gacaccagaa ttaaccagtc attcccggtg accgtcaact ggagtacccc 
atcttcccga atggcatcag tattcccaac aatccactgg cgctgctggc 
atcggcacgc tgggattcac gatcccgggc ttcaccattc ccgctgcgcc 
acgatcgaca tagacggcca gattgacggc ttcagcaccc cgccgatcac 

45 atcccgctga acctcggcgc cagcgtcact gtcggcccta tcctgatcaa 
atcccggcga ccccgggctt tggcaacacg accaccgctc cgtcgtcggg 
tccggcgacg gtggggtgtc gggcttcggg aatttcggtg cgggcagctc 
aaccaggcgc agaccgaggt ggctggggcg ggttcgggtt tcgccaattt 
ggatcgggtg tgctgaactt cggctcgggt gtgtcggggc tgtacaacac 

50 ccgccgggga ccccggcggt ggtctcgggc atcggcaatg ttggtgagca 
ttgtcctcgg cggggacggc actcaaccag agcctcatca tcaatctcgg 
gtgggcagcg taaacgtcgg tttcggcaac gtcggggact tcaacctggg 
atcggcgact tgaacgtggg tttgggcaat gtcggcggcg gcaacgtcgg 
atcggcgatg ccaacttcgg gttgggcaat gcgggtctgg cggcgggcct 

55 ggcaacatcg ggttgggcaa tgccggcagc ggcaacgtcg gcttcggcaa 
ggcaacatcg ggttcggtaa caccggcacc aacaacctcg ggattgggct 
aaccagactg ggatcggcgg cttgaactcc ggtgccggca acatcgggtt 
ggcaccggca acgtcgggtt gttcaactcc gggaccggga acttcgggtt 
ggcagcttca acaccggcat cggcaatggc ggaacgggca gtactgggct 

60 ggtaatttca ataccggtgt ggccaaccct gggtcgtaca acacgggcag 
ggtgacacca acaccggtgg tttcaacccg ggcagcatca acaccggctg 
ggcaacgcca acaccggcgt cgccaattcg ggcaatgtcg acaccggcgc 
ggcaacttca gcaacggcat cttgtggcga ggcaacttcg agggcctgtt 
gtcggcatca cgattcccga attcccgatc cactggactt caaccggcgg 

65 attatcatcc cggacaccac gatccttccc cccatccacc tgggcctcac 
aactacggct tcgccgtgcc ggacatcccc attccggcaa tccacatcga 
gccgccgacg ccggcttcac cgccccggcc accaccctgc tttctgcgct 



gggtgcgggg 
gttggggtcg 
gcagggtgcg 
ttcggtggcg 
tgaggcggcg 
gcgggcgttg 
ggccgagtac 
cgcgtcggct 
tggtgtcggc 
cgcgggcttg 
cctgggcgca 
tttcggtttc 
gccgccgggc 
aaacctgggt 
gttgaccggg 
gttgttcaac 
ggtattcaac 
gttgttcaac 
cagcttcaac 
ctggctgaac 
cgcgttcatc 

gtgggggctc 

cggcgtcggc 
tcaccaaacc 
tggcggtacg 
cccggctcag 
caacatttca 
ccttaatatc 
ggcagtaacg 
cagcgcgtcg 
gctgccgctg 
gatcgaccgc 
cggcgttaat 
tttcttcaac 
gggttggtgg 
cggttcgctg 
cggcgggttg 
gctgtcgggg 
gttggccgat 
tgcggccaat 
gttcggcaat 
ggccggggtg 
catgggtgtg 
gaccggggac 
gttcaactcc 
gttcaactcg 
tttcaatgcc 
cttcaatgtg 
gttcaacacc 
cctcatgtcg 
cggcctgaac 
catcggcccc 
gggacaagcg 
cttcgacggt 
gggcattacc 



60 
,.120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
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ggacaattca ggttcggccc gatcaccgtc 
aacctcaagc ttcagttcct ccacgacgcg 
tcggttcaga tacaggtcgc catacccctt 
ccgctgcagc agaccatcga cgccatcgaa 
5 cccatcgaca ttccgccgat cgacatcccg 
gaggtcgtgc cgatcgatgt gtccgtcgac 
atcgacccga ttccgctgaa cttcgacgtt 
atcatcgaca ttccggcgct gccgggcttt 
ttcttcaaca ccggcggcgg tggcggctcg 
10 ggcttgctga accaggcctc gagtccgatg 
ggcagcctgg catccggtgt gctgaactcc 
agcacgctgg gctccgcgcc ggcggtgatc 
tcgggggtgt ccatcgatgg cctgctggcg 
gggcagccga gcatqatcga cgcggcgatc 
15 atcgtcaacc tgggcaacgt cggcagctac 
aacctgggcg cgggcaacct cggcaacctc 
aacctggggt tgggcaacct cggggacggc 
aatgtcgggt tcggcaactc gggcctgggg 
ggcaacgccg gcagcaacaa cgtcggcttc 

20 ggcaataccg gcaccaacaa cctcgggatc 
ggcggcctga actccggtgc cggcaacctg 
gggttcttca acaccgggac cggaaactgg 
ggcatcggta acagcggaac gggcagtacc 
ggtctggcca atgccggtag ttacaacacc 

25 ggcggcttca accctggcaa tgtcaacacc 
ggcggcttca acacgggcaa tgtcaacacc 
ggcgcgctgt ggaccggtga tcaccacggg 
accggcagca ccctggtgga catcaacgaa 
cagatcgata ttcccggcat gtcgctgttc 

30 ttcaggatcg agcccatcga tgtccccgca 
atcccgccca tcgtcttcct gccgagcatg 
ctcgacacgc ccccggcccc cgccccgccg 
gcgctcggcg acaactggat cgttggggcg 
gtcaccgcac ccactcaggg catcctgatc 

35 agcctcgcac taaccctccc aaccgtcacc 
ctcaagatcg atgtgtcggg cggtcttccg 
atcccgcaaa atgcgatccc gttgaccatc 
atattcccgg gtggtttcac gatcgatccg 
gtgccggaca gcagcgttcc gatcatcatc 

40 accgccaccc cgtcgtcggg tttcttcaac 
aacttcgggg ccggcagctc aggctggtgg 
ggctcgggcg ttctcaacgt tggcacgctg 
atatcggggc tgtacaacac cgctatcgtg 
gccggcaacg tgggccagca gctgtcgggg 

45 agccccatca tcaacctcgg gttggccgat 
gttggggact tcaacctggg cgcggccaac 
atcgggaacg ccaacgtcgg cttcggcaat 
tcgggcctgg gggcggcgct cggcatcggc 
aacgttggcc tggccaacat gggtgtgggc 

50 aacctcggga ttgggctgac cggcgacaac 
gccggcaaca ttggcctgtt caactccggc 
accggaaact gggggttgtt caactcgggc 
acgggcagta ctgggctttt caatgccggt 
tcgtacaaca cgggcagctt caatgtcggt 

55 agcatcaaca ccggctggtt caacaccggt 
aatgtcgaca ccggcgccct catgtcgggc 
aactacgaag gcctattcag ctattcctac 
gacgcgcatt tcaccggggc cttcggcccg 
atcaacgcgc acctgaccgg caacgcggcg 

60 attcccgccc tcaatccgaa cgtcaccgga 
tcggtcacca ttcccgccct gaccgccgca 
ggggcgacca gcgaaataga gccgtttatc 
acgtggtact cggtcggcag aatctacaac 
atctcgggaa tcccgacgct cagcacgacc 

65 tctcaggcgt tcaacacccc ggcgctcaat 
ccgggcagca tcgacgcgat caccctgttc 
ctgctgaacc tggatgtgtt cgtcggcacc 



tcaaacgtcc agctcaatcc gttcaacgtt 3360 

ttcccaaatg aatttcccga tcccacaatc 3420 

acttcggcaa cgctgggcgg attggccctg 348 0 

ttgccggcaa tctcgttcag ccaatccata 3540 

gcctccacta tcaacggaat ttcgatgtcg 3 60 0 

attccggcgg tcaccatcac cggcaccagg 3660 

ctcagcagcg ccggacccat caacatctcg 3720 

ggcaactcga ccgagctgcc gtcgtcgggc 3780 

ggcatcgcca acttcggcgc gggggtgtcc 3840 

gtggggacgc tctccggcct gggcaatgcc 3900 

ggcgtcgaca tctcgggcat gttcaacgtg 3960 

tcgggtttcg gcaacctggg caaccacgtg 402 0 

atgctgacca gcggcgggtc gggcggctcc 408 0 
gccgagctgc ggcaqctgaa tccgctgaac . -4X4 0 

aacctcggct tcgccaacgt cggcgacgtc 42 0 0 

aacctcggcg gtggcaacct cggcgggcag 42 60 

aacgtcgggt tcggcaacct cggccacggc 43 20 

gcgctgccgg ggatcggcaa catcgggttg 43 8 0 

ggcaacatgg gcctgggcaa catcgggttc 4440 

gggctgaccg gcgacaacca gaccgggttc 4500 

gggttgttca actccggcac cggcaacatc 4560 

gggttgttca actcgggcag ctacaacacc 4620 

gggcttttca atgccgggag tttcaacacg 468 0 

ggcagcctca acgcgggcaa caccaacacc 4740 

ggctggttca acgccggcca caccaacacc 480 0 

ggcgcgttca actccggcag cttcaacaac 4860 

ctggtcggct tctcctacag catcgaaatc 4920 

accctcaacc tcggtcccgt ccacatcgat 498 0 

gacatccacg aactcgtcaa catcgggccc 504 0 

gtggtgctgg acatccacga aacgatggtc 5100 

acgatcggcg gtcagaccta cacgattccg 5160 

cccttcagac ttccgttgct gttcgtgaat 5220 

tccaactcaa ccggaatgag tggtggcttt 528 0 

cataccggtc ccagcagcgc aaccaccggt 5340 

atcccaacga tcacgacatc gcctatcccg 540 0 

gccttcacgc tgttccccgg tggcctcaac 5460 

gatgcgtccg gcgtgctgga tccgatcacg 5520 

ctgccactga gcctggccct caacatcagc 5580 

gttccgccga cgcccggctt cgggaacgcg 5640 

tccggcgcgg gcggggtgtc gggtttcggc 5700 

aaccaggcgc atgccgcgtt ggcgggcgcg 5760 

aactcgggtg tgctgaacgt cggctcgggg 5820 

ggtttgggga cgccggcgct ggtgtcgggt 5880 

gtgttggcgg ccgggacggc gttgacccaa 5940 

gtcggcaact acaacctcgg gttgggcaac 6000 

ctcggcgacc tcaacctagg gttgggcaat 6060 

atcggccacg gcaacgtcgg gtttggcaat 6120 

aatatcgggt tgggcaatgc gggcagcacc 6180 

aacatcgggt tcgccaacac cggcaccaac 6240 

cagaccggca tcggcggctt gaactccggt 63 00 

accggcaaca tcgggttctt caactccggg 63 60 

agcttcaaca ccggcatcgg taatagcgga 642 0 

ggtttcacta cgggtctggc caacgccggg 648 0 

gacaccaaca ccggtggctt caacccgggc 654 0 

aacgccaaca ccggcatcgc gaactcgggc 6600 

aacttcagca acggcatcct gtggcggggc 6660 

agcctcgacg ttccccggat caccatcctg 6720 

gtggtcgtcc cgcccatccc ggttctggcg 6780 

atgggcgcct tcaccatccc gcaaatcgat 6840 

agcgtcggct tcggccccat cgcggtcccc 6900 

cgagcggtcc tcgatatggc cgcgtcggtc 6960 

gtctggacgt catccggtgc gatcggccca 7020 

gccggtgacc tgttcgtcgg cggcaatatc 7080 

ggcccggtgc atgccgtctt caatgcggca 7140 

attcaccaga tcccgttggg tttccaggtg 7200 

cccggtggtc tgacgttccc ggcgaactcg 7260 

cccggcgcca ccattccggc gatcacgttc 7320 



WO 2005/076010 PCT/IN2005/000037 

208/341 



ccggagatcc cggcgaacgc cgacggcgaa ctctacgtca tcgccggcga catcccgctg 73 8 0 

atcaacatcc cgcccacccc gggcattggg aacaccacca ccgttccgtc gtcgggcttc 7440 

ttcaacaccg gggcgggcgg gggctcgggt ttcggcaact tcggcgcgaa catgtcgggg 7500 

tggtggaacc aggcgcacac cgctttggca ggcgcgggtt cgggtattgc caacgtcggc 7560 

acactgcact ccggcgtgct caacctcggt tcggggctgt cggggatcta caacaccagc 7620 

acgctgccgc ttgggacgcc ggcgttggtg tcgggcctgg gcaacgtcgg tgatcacctg 768 0 

tcgggcttgt tggcctccaa cgtggggcaa aaccccatca ccatcgtcaa catcgggttg 7740 

gctaacgtcg gcaacggcaa cgtcggcctc ggcaacatcg gcaacctcaa cctgggtgcg 78 0 0 

gccaacattg gcgacgtgaa cctgggattc ggcaacattg gcgacgtgaa cctgggcttc 78 60 

ggcaacatcg gcggcggcaa cgtcgggttc ggcaatatcg gcgatgccaa cttcgggttc 7920 

gggaattcgg gtctggcggc gggcctggcc ggcatgggca atatcgggct gggcaacgcc 798 0 

ggcagcggca acgtcggctg ggccaacatg ggcctgggca acatcgggtt tggcaacacc 8040 

ggcaccaaca acctcgggat cgggctcacc ggcgacaacc agtccggcat cggcggcttg 8100 

aactccggca- ctggcaacat tggcctgttc aactccggca caggcaatat cggcttcttc ai6Q 

aactccggga ctgccaactt cgggttgttc aactccggca gctacaacac cggtatcggc 822 0 

aactccgggg tggccagcac cgggttggtc aacgccggcg gcttcaacac cggtgtggca 82 8 0 

aacgccgggt cgtacaacac gggcagcttc aatgctggtg acaccaacac cggtggcttc 83 40 

aacccaggca gcaccaacac cggctggttc aacaccggta acgccaacac cggcgtcgcc 840 0 

aacgcgggca atgtcaacac cggcgccctc atcacgggca actttagcaa cggcatctta 84 60 

tggcggggca attacgaggg cttggccggc ttctccttcg ggtaccccat tccgctgttc 8520 

cccgcggtgg gcgccgacgt caccggcgac atcggccccg ccaccatcat tccgcccatc 85 80 

cacatcccgt ccatcccgtt gggcttcgcc gcgatcggcc acatcgggcc gatcagcatc 8640 

ccgaacatcg ccatcccctc gatccacctg ggcatcgatc ccaccttcga cgtcggccct 87 0 0 

atcaccgtgg accccatcac cctcaccatc cctggcctaa gtttggatgc tgccgtctcg 8760 

gagatcagga tgacgtccgg aagcagctcc ggattcaagg tcagacccag cttttcattc 8820 

ttcgcggtcg gacccgacgg catgcccggg ggcgaggtct ccatacttca accattcacc 88 8 0 

gtggcaccca tcaacttgaa cccgacgaca ctgcacttcc ccggattcac cattcccacc 8940 

ggacccatcc acatcggcct gccgctgtcg ctgaccattc cgggcttcac catcccgggc 90 00 

ggcaccctga ttccccaact cccgctgggc ctcggtttgt ccggcggcac cccacccttt 90 60 

gatctcccga cggtcgttat cgaccggatc ccggtggagt tacacgccag caccaccatc 9120 

ggccccgtca gcctcccgat tttcgggttc ggcggagcac cgggctttgg caacgacacc 918 0 

accgcgccgt cgtcgggctt cttcaacacc ggcggtggtg gcgggtccgg cttctccaac 9240 

tccgggtcgg gcatgtcggg ggtgctcaac gcgatctcgg atccgctgct cgggtcggcg 93 0 0 

tcgggcttcg ccaatttcgg cacccagctc tccggcatcc tcaaccgtgg cgcgggcatc 93 60 

tcgggcgtgt acaacacggg cacgcttggc ctggtcacat cggccttcgt ctcgggcttt 942 0 

atgaacgtcg gccagcagct gtcgggcctg ctgttcgcgg gcaccgggcc gtaa 9474 

,<212> Type : DNA 
<211> Length : 9474 

SequenceName : SEQ ID 526 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgagttttg tcgtaatgcc gccagagatc aactccctgc tgatatatac 
ccgggccccc tgttggcggc ggccgcggcc tgggatgagc tggccgccga 
gcggcggcgg ctttcgggtc ggtgacctcg gggctggtcg gtggtatctg 
tcctcggtcg cgatggcggc ggcggccgcc ccgtatgcgg ggtggttgag 
gcctccgccg agtcggcggc cgggcaggcc cgggcggtgg tgggtgtgtt 
ttggccgaga cggtggaccc cttcgtcatc gcggccaatc ggtcgaggtt 
gcgttatcga atctgttcgg ccagaacacg ccggcgatcg cggccgcgga 
gagctgatgt gggcccagga cgtggccgcg atgctggg^t accacaccgg 
gcggccgagg cgttggcgcc gtttggctcg ccgctggcaa gcctggcggc 
ccggccaagt cgctcgccgt caatctgggt ttggccaacg ttggcctctt 
agcggcaacg tcggcagcta caacgtgggg gccggcaacg tcgggagcta 
ggcggcaata tcggtggcaa caatgtcggg ttgggcaatg tcgggtgggg 
ctcgggaatt cggggttaac gccgggtctg atgggtctgg gtaatatcgg 
gccggcagct acaatttcgg tttggcgaat atgggtgtgg gcaatattgg 
accggcagtg ggaatttcgg tattgggttg accggtgata atctgaccgg 
ttcaataccg gtagcgggaa tgtggggttg tttaattcgg ggaccggtaa 
tttaactccg gcaccgggaa ctggggggtg ttcaattcgg ggagctataa 
ggtaattcgg ggattgccag cacggggttg ttcaacgcgg gtgggttcaa 
gtcaatgcgg gtagctacaa caccggcagt ttcaacgcgg gggaggccaa 
ttcaacccgg gcagtgtcaa cacgggttgg ttgaacaccg gtgacatcaa 
gccaactccg gcgacgtcaa caccggtgcc ttcatctccg gtaactacag 



cggggcgggt 60 

gctgggctcg 12 0 

gcaggggccg 180 

cgcggcggcg 240 

cgaggcggcg 3 00 

agtgtcgctg 3 60 

gttcgactac 420 

cgcctcggcg 480 

cgccgccgag 540 

caacgcggga 600 

caacgtgggc 660 

caactttggg 720 

gtttggtaat 78 0 

gttcgctaac 840 

gttcggtggt 900 

tgtggggttc 960 

caccgggatc 1020 

t acgggtgtg 10 80 

tacgggcggt 1140 

caccggggtg 12 0 0 

caacggcgtc 12 60 
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ttgtggcggg gcgactacca gggcctgctc 
gtcattcccc tcagcctgga cataaacggt 
cacatacttc ccgacatccc gatcaacatc 
gtcccgccca tcaacgtccc ggcaatctcg 
5 ggccccatca aaatcaatcc catcaccctg 
acgctggcct ggcccgtctc gtccataaca 
ccttccccca ttcccacaac cttgatcggc 
ccagtaacct tcagttattc caccccagcc 
cccaccgggg ggccgctcac cctgactctt 
10 attccgggat tctcgattcc cgagcaacca 
atcaacgccc tgagcacccc ggcaatcacc 
atcggcggcg tcgggcccgt cgacatcgtg 
gggaattcga ccaccgcccc gtcctcgggc 
ggattcggga acgtcggcgc gcacacctcg 
15 caggtgttgc cgggaacggt ttcgggctat 
ggcaatgtag gtactcagtt gtcggggatg 
ttcggattgg gcaatatcgg gttcgacaac 
ttcggcctgg cgaacatggg catcggcaac 
atcggtatcg ggctcagcgg ggacaatctg 

20 gagaatgtgg gattgttcaa ttcgggtacc 
gggaatttgg gcgtattcaa ttcgggaagc 
aacatcaacg tcctggcccc cttcaccccc 
attgacctgc aggtgattgg cgggatcggt 
gcattcgaca tacaaatcac cgggggattc 

25 atcaccattc ccgcgatccc gatccacgtg 
gtgaatcccg cgtttgttct tttcgggcaa 
gtcgtcttgc cggacccgtt catcacgatc 
ggcgcgaaat tcccctccgg gtcgttctac 
cccatcatcg gcagctacgg cggcccgggc 

30 aatctgtcca catcgtcctt ggccttgttc 
ccggtgaccg tcaacctgac cggcggcctg 
gcgttcccgg aaaaccccgt ggtcagcctc 
accgtgttcc cgcagggttt cacggttgac 
tctattggcc ctttcccgtt ccggtgggac 

35 attccggcgg tcccaggcgg ctttggcctc 
ctcaacggcg gcatcggccc catcagcatc 
cccctcctga ccgtcacggg aaacctcgag 
atccccgcca tcaattttgg cctcgacgga 
actaccctgc tgtccggcct gggcatcacg 

40 acgaatatac agacccagcc cgcccagctg 
tttgacttca gagacggcat cgaactgaat 
atcaccatgg cgggactcag tatcccgttg 
ttctctttcg gttccccggc atccaccgta 
atcgacgtgt ccatcaacct cgaggacgcc 

45 attccgctga acgtggacgt cacaatcccg 
gagccgggct cagggaactc caccaccacc 
ccgggcctgg gtgtgggact gttgggcctg 
tcgggcttca attccgcggt cgggatcgtc 
ggcggcaatg tcgggttggg caatgtcggg 

50 gggttcaatg tgggtggcgg caatatcggc 
tttggcaatg tcgggttggc gaattcgggg 
atcgggtttg gtaatgccgg cagctacaat 
attgggttcg ctaacaccgg cagtgggaat 
accgggttcg gtggtttcaa taccggcagc 

55 ggtaatgtgg ggttctttaa ctctggcacc 
tataataccg ggatcggtaa ttcggggatt 
ttcaacacgg gtgtggtcaa tgcgggtagc 
gccaatacgg gcggtttcaa cccgggcagt 
atcaacaccg gggtggccaa ctccggcgac 

60 tacagcaacg gcgccttctg gcggggcgac 
cccgccgtgc ttccccaaac gccgttcctg 
gtcgttatcc ccgccatcga cattcccgcg 
atcgacagct tcactgtgcc gagcatcccg 
gtcagcgtcg gcctcggccc catcaccgtc 

65 acgctgaatt acttgtttgg ctcacaaccc 
ggactcttca acacccccat cggccttacc 
gctagctcgt cgcaagggac catcacggct 



ggcttctcct cgggagcgaa cgttcttcct 1320 

ggtgtcggcg ccatcactat cgagcccatc 13 80 

aacgaaaccc tctacctcgg acccttggtc 1440 

ctcggcgtcg gcatacccaa tatttcgatc 15 00 

tggccggcac aaaacttcaa ccaaaccatc 1560 

attccccaaa tccaacaggt cgccctcagc 1620 

ccaatacata tcaataccgg gttttccatc 1680 

ctcacccttt tcccggttgg cctcagcatt 1740 

ggcgtaacgg caggcaccga ggccttcacc 18 00 

cttccgctgg cgatcaacgt gattggccac 18 60 

atcgacaaca tccccctgaa cctgcacgcc 1920 

ggcggcaacg ttccggcgtc gccggggttc 198 0 

ttctttaaca ccggcgccgg cggggtgtcg 2 040 

ggctggttta accagtccac -ccaggccatg ,-2100. 

ttcaactccg gcacgttgat gtcgggtatc 2160 

ttgtcgggcg gcgcgctggg cggcaacaac 2220 

gtcgggttcg gcaatgccgg cagcagcaat 2280 

atcgggttgg ccaataccgg caacgggaat 2340 

actgggttcg gtggtttcaa ttcgggcagc 240 0 

ggcaatgtcg ggttcttcaa ctccggcacc 2460 

cacaacaccg gtttcttctt aacgggcaac 252 0 

ggcacccttt tcaccatttc cgaaatcccc 25 8 0 

cccatccacg tccagcccat cgatattccg 2640 

atcgggatcc gcgaattcac tctcccggag 270 0 

accggaaccg tcggcctcga gggttttcac 2760 

accgccatgg cagaaatcac cgcggacccg 28 2 0 

gaccactacg gcccgccgct aggcccaccc 288 0 

ctcagcatca gcgacctgca gatcaatgga 2 940 

acgatccccg gcccgttcgg cgcgactttc 3000 

ccggccggcc tcaccgttcc cgaccaaaca 3 0 60 

gacagcatca cgctattccc gggcggcctg 3120 

accaacttct ccgtcggcac cggtggattc 3180 

cgcatcccgg tggacctgca caccaccctt 3240 

tacatcccgc caaccccggc caacggcccc 33 00 

accagcgggc tgtttccatt tcacttcacc 33 60 

ccgaccacca ccgttgtgga tgcgctaaat 3420 

gtcggcccct ttaccgtccc ggacatcccc 3480 

aacgtcaatg tgagttttaa cgccccggcc 3540 

ggaagcatcg atatctccgg aatccaaatc 3 600 

ttcatgtcgg tcggtcagac tcttttcttg 3660 

ccgattgtaa ttccgggaag ttctattccc 3 720 

cccactgtca gtgaatccat accgctcaac 378 0 

aagtcaatga ttctgcacga aatcctgcct 3840 

gtcttcatcc cggccaccgt gctgcccgca 390 0 

gtgggcccca tcaacatccc gatcatcacc 3 9 60 

acgtcggatc ccttcagcgg tttggctgtc 402 0 

ttcgacggga gcatcgccaa caacctgatc 408 0 

ggcccgaacg tggggttgag caacctcggt 4140 

gactttaacc tgggtgcggg caacgtcggt 42 00 

ggcaacaatg tcgggttggg caatgtcggg 42 60 

ttaacgccgg gtctgatggg tttgggtaat 432 0 

ttcggtttgg cgaatatggg tgtgggcaat 43 8 0 

ttcggtattg ggttgaccgg tgataatctg 444 0 

gggaatgtgg ggttgtttaa ttcggggacc 45 0 0 

gggaactggg gggtgttcaa ttcggggagt 4560 

gccagcacgg ggttgttcaa cgcgggtggg 462 0 

tacaacaccg gcagtttcaa cgcggggcag 468 0 

gtcaacacgg gttggttgaa caccggtgac 4740 

gtcaacaccg gcgccttcat ctccggtaac 48 0 0 

taccagggcc tgctcggctt ctcctaccgc 4860 

gacctcaccc tcaccggcgg actgggctcc 492 0 

atccgccccg agttcagcgc caacgtcgcc 498 0 

attccccaga tcgacctggc cgccaccacg 504 0 

ccgcacctcg atattccacg ggtgcccgtc 5100 

ggcgggcccc tgaaaatcgg tccgattacg 5160 

cccctggcgt taagccagat agtcatcggt 522 0 

ttcctggcca acctgccgtt cagcaccccc 528 0 
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gtcgtcacca ttgacgagat cccgctgctg gccagcatta ccggccacag cgagcccgtc 5340 

gacatcttcc cgggcggcct cacgatcccc gcgatgaacc cgctgagcat caacctgtcc 5400 

ggtggcaccg gcgccgtcac cattccggca atcaccatcg gcgaaatccc ctttgacctc 54 60 

gtggcccaca gcacgctcgg ccccgttcac atcctcatcg accttcccgc cgtgccgggg 5520 

ttcgggaata cgaccggtgc tccgtcgtcg ggtttcttca actccggtgc gggtggggtg 55 8 0 

tcggggtttg ggaatgtcgg cgcgatggtg tcgggtggct ggaatcaggc tccgtcggcg 564 0 

ttgctgggtg ggggttcggg tgttttcaac gccggcacgc tgcattcggg tgtgctgaat 570 0 

ttcggctctg gcatgtcggg gctgttcaac accagcgtgt tggggttggg tgcgccggcg 5760 

ttggtgtcgg gtttgggtag tgtcggtcag cagttgtcgg gattgttggc gagcgggacg 5820 

gcgctgcatc agggtctggt cctcaatttc gggttggccg atgtggggtt gggcaatgtc 58 8 0 

gggttgggca atgtcgggga ctttaacctg ggtgcgggca acgtcggtgg gttcaatgtg 5940 

ggtggcggca atatcggcgg caacaatgtc gggttgggca atgtcgggtg gggcaacttt 60 0 0 

gggctcggga attcggggtt aacgccgggt ctgatgggtt tgggtaatat cgggtttggt 60 60 
aatgccggca gctacaattt cggtttggcg aatatgggtg tgggcaatat tgggttcgct , 6120 

aacaccggca gtgggaattt cggtattggg ttgaccggtg ataatctgac cgggttcggt 6180 

ggtttcaata ccggcagcgg gaatgtgggg ttgtttaatt cggggaccgg taatgtgggg 6240 

ttctttaact ctggcaccgg gaactggggg gtgttcaatt cggggagtta taacaccggg 63 0 0 

atcggtaatt cggggattgc cagcacgggg ttgttcaacg cgggtgggtt caacacgggt 63 60 

gtggtcaatg cgggtagcta caacaccggc agtttcaacg cggggcaggc caatacgggc 6420 

ggtttcaacc cgggcagtgt caacacgggt tggttgaaca ccggtgacat caacaccggg 648 0 

gtggccaact ccggcgacgt caacaccggc gccttcatct ccggcaacta cagcaacggc 6540 

gccttctggc ggggcgacta ccagggcctg ctcggcttct cctacaccag caccatcatt 6600 

cccgaattca ctgtcgcgaa catccacgcg tccggcggcg ccggacccat catcgttccg 6660 

tcgatccaat ttccggcaat tcccttggac ctcagcgcaa ccggccacat cggcggcttc 6720 

accatcccgc cggtgtccat ttccccgatc acggttcgca tcgacccagt cttcgacctc 6780 

ggccccatca ccgtccagga catcacgatt cccgccctgg gactcgaccc cgcaaccggt 684 0 

gtcaccgtgg gcccgatatt cagctcaggc tccatcatcg atccattcag ccttacgctg 690 0 

ctggggttca tcaacgttaa tgtccccgcc atccaaacgg cgcccagcga gattctgcca 69 60 

ttcaccgtgc tgctgagttc gcttggcgtg acccatctaa caccggaaat caccatcccg 702 0 

ggattccaca tacccgtcga tccaatccat gtcgagctgc ccctgtccgt caccatcgga 70 8 0 

cccttcgtga gcccggaaat caccattccc caactcccgc tgggcctcgc gttgtccggc 714 0 

gccacccccg ccttcgcctt ccctctggag atcaccatcg accgaatccc agtggttctc 72 0 0 

gacgtcaacg cgctgctcgg ccccatcaac gccgggttgg tcatcccgcc cgtcccggga 7260 

ttcggcaaca ccaccgcggt cccgtcgtcg gggttcttca acatcggcgg tggcggcggg 73 2 0 

ttgtcgggct tccacaacct cggcgcgggc atgtcgggcg tgctcaacgc gatctcggat 73 80 

ccgctgctcg gatcggcgtc gggcttcgcc aacttcggca cccaactctc cggcatcctc 744 0 

aaccgcggcg cggacatctc gggtgtgtac aacaccggcg cgctgggcct gatcacctcg 7500 

gccctggtct ccggcttcgg caacgtcggc cagcaactgg cgggcctgat ttacaccggc 7560 

accgggccct aa 7572 
<212> Type : DNA 
<211> Length : 7572 

SequenceName : SEQ ID 527 

SequenceDescription : 



Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<40 0> PreSequenceString : 

atgtcatttg tgattgcggt gccggaagca ttgacgatgg cggcttcgga tctggccaac 60 

attgggtcga cgatcaacgc ggcgaatgcg gcggcggcat tgccgaccac gggggtggtg 12 0 

gcggctgccg cggatgaggt ttcggcggca gttgcggcct tgttcgggtc gtacgcgcag 180 

agctatcagg cttttggtgc gcagctgtcg gcgtttcacg cccagttcgt gcagtccctt 240 

acgaacggcg cgcgctcata cgtagttgcc gaggccacca gtgctgcgcc gttgcaggat 3 00 

ttgttgggcg tggtaaatgc ccccgcccag gcgttgttgg ggcgcccgtt gatcggcaat 3 60 

ggcgctaacg gggccgacgg gacgggggct cccggtgggc cgggcgggct gttgcttggc 42 0 

aacggcggga atggcggatc gggtgcgccg ggtcagccag gtggtgctgg cggggatgcg 48 0 

gggttgatcg gtaacggcgg gactggcggt aaaggtgggg acgggctggt cgggtccggt 540 

gctgccgggg gtgtcggtgg tcgcggtgga tggttgctgg gtaatggcgg gaccggtggg 600 

gctggtgggg ctgcaggggc cactttggtc ggcggtactg gcggtgtcgg tggggcgacg 660 

gggttgatcg gcagcggggg cttcggcggt gctggcgggg ccgcggcggg ggtgggcacc 720 

accggcggcg tgggcgggag cggtggcgtt ggcggcgtgt tcggcaatgg tggattcggc 780 

ggggccggtg gccttggcgc cgccggcggc gtcggagggg cggccagcta cttcgggacc 840 

gggggcggtg gcggcgttgg tggggacggt gcgcccggtg gtgacggcgg tgcgggtccg 90 0 

ctattgatcg gcaatggcgg tgttgggggt ctgggtgggg ccggggcggc cggtggtaat 960 

ggcggtgccg gcgggatgtt gttgggcgat ggcggtgccg gcggacaggg tgggccggcc 1020 

gtggcgggtg tcctgggcgg gatgcccggc gcgggcggca acggcggtaa tgccaactgg 1080 

ttcgggtccg gtggtgccgg cgggcagggt ggcaccggtc tggccgggac aaacggggtc 1140 
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aaccccggct cgattgcgaa ccccaacacc ggtgcgaacg gtaccgacaa cagcggcaac 1200 

gggaatcaaa ctggcgggaa cgggggtccc ggccccgccg gtggcgtcgg cgaggctggc 12 60 

ggcgtcggcg ggcagggcgg gctgggggag tcgctcgacg gcaacgacgg caccggcggt 132 0 

aagggtggag ccgggggtac tgccggtacc gatggcggtg ccggcggcgc tggcggcgct 13 8 0 

5 ggcggcatag gtgagaccga cggcagcgcc ggcggcgtgg ctaccggggg tgaggggggt 1440 

gacggtgcca ccggaggggt cgacggtggc gtgggtggtg ctggcggcaa gggggggcag 15 00 

gggcacaaca cgggtgtagg tgacgctttc ggcggtgacg gcggaatcgg cggtgacggt 1560 

aacggggcac taggcgcggc gggcggtaac ggcggcaccg gtggtgccgg tggaaacggt 162 0 

ggacgtggcg ggatgttaat cggcaacggc ggcgccggtg gggccggcgg gacgggcggc 168 0 

10 accggtggtg gtggcgccgc cggcttcgcg ggcggtgtcg gcggcgcggg cggagagggt 1740 

ctcaccgacg gtgcgggtac cgcggaaggc ggcaccggcg gtctgggggg cctcggcggt 1800 

gtcggcggta ccggcggtat gggtggcagc ggcggtgtcg gcggcaacgg cggggcggct 1860 

gggtcgctca tcgggcttgg tggtggcggg ggtgccggcg gtgtcggcgg caccggtggc 1920 

atcggcggca tcggtggtgc cggcggcaac ggtggcgccg gcggcgcggg taccaccacc . 19S0 

15 ggcgggggag cgacaattgg cggtggcggc ggtacaggcg gcgtgggggg cgctggtggc 2040 

actggcggta ccggcggcgc cggcgggacc accggcggca gcggcggagc cggcgggctg 2100 

atcgggtggg caggagctgc cggcggcacc ggcgcaggcg gcacgggtgg gcaaggtggc 2160 

ctcggcggcc agggcggcaa cggcggcaac ggcgggaccg gcgcgaccgg cggtcagggc 222 0 

ggcgatttcg cgctgggcgg caacgggggc gccggcggcg cgggtgggtc accgggtggc 228 0 

20 agctccggca tccagggcaa tatgggcccg cccggcaccc agggcgccga cggatag 233 7 

<212> Type : DNA 
<211> Length : 2337 

SequenceName : SEQ ID 528 
25 SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

30 <400> PreSecjuenceString : 

ccccaaggcg ccgacggcaa cgccggcaac ggcggtgacg gcggggtcgg cggcaacggc 60 

ggaaacggcg cagacaacac caccaccgcc gccgccggca ccacaggcgg ggccggcggg 120 

gccggcgggg ccggcggaac cggcggaacc ggcggagccg ccggcaccgg caccggcggc 180 

caacaaggca acggcggcaa cggcggcaac ggcggcaccg gcggcaaagg cggcaccggc 240 

35 ggcgacggtg cactcgcagg cagcagcggt ggtgccggcg gtaaaggcgg caacggcggc 3 00 

gacgccggca aggccggtac cggctccgct cctggcacgg cggggaccgg cggcgatggg 3 60 

ggtaagggcg gcaacggcgg cattggcgct gccggcacaa ccggccccgt aggcaccggc 420 

gcgtccggcg gcaccggtgg tagtggtggc gccggcggaa ccggcggtga cggcggcgcc 480 

gccaacggcg gcaccgccgg ggctggcggg gcgggcggca atggcggcaa aggcggcgac 540 

40 ggtggagcag gcgtcaccag cagcaccgcc ggcaacagcg gcggcgcggg cggcagcggc 60 0 

ggaaagggcg gagacgcggg cgcgggcggc gccggtgcca ctccgggcgc caacggtatc 660 

gctggcaatg gcggcgacgg cggagatggc gcggctggtg ccgtcggcat ctccggcgca 720 

accggcgctg gcgacggcgg gcatggcgga accggcgcgg ccggcggcaa cggtggaacc 78 0 

ggcggtgctg gcggtagcgg catcgacggc gtcggcggcg ggaccggagg taccggcggc 840 

45 aacggcggca acggcgccat cggcggcgct ggcggagacg ccggtggtag cggaaatagc 900 

ggcggaaacg gtgggattgg cggaaagggc ggaaacgccg gtgccggtgg tgccgcgggc 960 

agcaacggcg gtaccgtcgg cgccaacggt accggcggcg acggcggcaa cggcggcgct 1020 

gccggggccg ccacggctgg cagcaacggt ggggccggca ccggctcggc cggcggcaac 10 80 

ggcggcaccg gcggcagagg cggcagtggt ggcgccggcg gcgacggtat cggtggcgtc 114 0 

50 ggcggcggca agggcggcaa cggcgcggac ggcgaagtcg gcggtgcggg cggcgccggc 1200 

ggcagcgggc ccaacaccag tcccggcggc aacggcgggc aaggaggtca aggcggcagc 12 60 

ggtggtgccg gtggggcggc cggggctggc ggcgccggtg gcggcgctaa cggcaccgct 13 2 0 

ggcaacggcg gccaaggcgg tgccggcggc accggcggcg ccggcgcagc ctcctcagct 13 80 

accaacggcg gcagcggcgg cgccggcggc accggaggcg acggcggcag cggcggcgcc 1440 

55 ggcggcaccg gaggcgccgg cggcaccggc ggggcggccg gcgacggcgg acaaggtggc 15 00 

cagggcggcg ccggcggcgg tgccggtggt caaggtggtg ccggcggtgc cggcgggacc 1560 

ggcggcaacg gcggcaatat caccggcggc accgcgggca ccgcgggggc cgccggtaac 1620 

ggcggcgccg ccggaaaggg tggcgccggc ggccaaggcg gcaccggtgg cgggaccggg 1680 

ggtcagggtg gcgccggcgg cgacggcggt gccggcggca ccggcggcga ccgcaccgtc 1740 

60 ggcggtggca cggtccccgc cggctccggt ggacaaggcg gtaacgctgg cggtggtggg 18 00 

gccggcgggc agggtggagc cgacggcggc agcggcggcg acggcggcga cgccggcaca 18 60 

ggtggcaatg gcggtaacgg cggcaaccgt aattccggca atggcaccgg cggcgctggc 1920 

ggcaacggtg gtggtggtgc taacggtggc gccggcggcg ctgggggcag cggcggcggc 1980 

accggcggca acggcggcgc tggcggcgac gccggcgacg ccggcaacgg cggcaacggc 2040 

65 aacggcaccg gcaacggcgg caacggcggc aacggcggca tcgccggcat gggcggcaac 2100 

ggcggtgccg ggacgggcag cggcaacggc ggcaacggcg gcagcggcgg caacggcggc 2160 

aacgccggca tgggcggcaa cagcggcacc ggcagcggcg acggcggtgc cggcgggaac 2220 
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ggcggcgcgg cgggcacggg cggcaccggc 
accggcggca gcggtggcac cggcggtgac 
accgcaaaca tgactgcgca ggcgggcggt 
ggcggcgggg ccggggccgg cggcggtggc 
5 caaggcggcg ccggcggcga tggcggcaac 
gacgaccccg gcggcaacgg gggcaccggc 
gcgggcatcg gcagccttgg cggcggcact 
ggtaccggcg gcgagggcgg cgaggtcggc 
aatggcggcg atggcggcac cggcggcacc 

10 ggcggcaccg gcggcaccgg cggcctcggc 
ggcggcaccg gcggcagcgg cggtgcggcc 
gcgggaggca atggcaacgg cggcaccggt 
aatggcggcg acgccgagcc cggagtgccc 
accaccggcg gcaagggtgg caccggcggc 

15 ggcggcgatg gcggcaccgg cggtggtggt 
aagggagaca ccggcagcgg cggtggcgcc 
accggcggcg ccggcggcga cggcggagcg 



ggcgacggcg gcctcaccgg tactggcggc 2280 

ggcggtaacg gcggcaacgg agcagataac 2340 

gacggtggca acggcggcga cggtggcttc 2400 

ttgaccgctg gcgccaacgg caccggcggg 2460 

ggggccatcg gcggccacgg cccactcact 252 0 

ggcaacggcg gcaccggcgg caccggcggc 2580 

ggcggcgatg gcggcaacgg cggcaacggc 2640 

ggcgccggcg gcaccggcgg tgcggccggc 2700 

ggcggcgggg acgggggcgc cggcggcacc 2760 

gacccccggg tcggcggatc cggcggcgac 2820 

ggcaatggcg gcaacggcgg caacgccggc 2880 

ggggccggcg gtatcggcgg caccggcggc 2940 

ccgggagccg gtggtgctgg cggcgccggc 3 000 

aacggcagtg gcaccggctc gggcggcacc 3060 

gggaacggcg gcaccggctg gaatggcggc 3120 

ggagacggtg gtaaggcacc agccggtggc 3180 

ggcggcaagg gcggcagcgg cggcgtctag 3240 



<212> Type : DNA 
20 <211> Length : 3240 

SequenceName : SEQ ID 529 
SequericeDe script ion : 



25 



Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atggtcatgt cgctgatggt ggcgccggag ctggtggcgg cggccgcggc ggacttgacc 60 

gggattgggc aggccatcag cgcggcgaat gcggcggcag cgggcccgac gacgcaggtg 120 

30 ttggcggccg ccggtgatga ggtgtcggcg gcgatcgcgg cgttgtttgg tacccacgcg 18 0 

caggagtacc aggcgttgag cgcccgggtg gcgacgtttc atgagcagtt tgtgcgctcg 240 

ctgaccgcgg ctggcagcgc gtatgcgact gccgaggcgg cgaatgcatc accgctgcag 3 00 

gcgctggagc agcaagtgtt gggtgcgatc aacgcgccca cacagctgtg gttggggcgc 360 

ccgctgatcg gtgatggcgt tcacggggcg ccggggaccg ggcagcccgg tggggccggg 42 0 

35 gggttgttgt ggggtaatgg cggtaacggc ggttcggggg cggccggtca agtcggtggg 480 

cccggcggcg cggccgggtt gttcggcaac ggcgggtccg gcgggtccgg cggggccggc 540 

gctgccggcg gtgtcggcgg atccggcggg tggttgaacg gcaacggcgg ggccggcggg 600 

gccggcggga ccggcgctaa cggtggtgcc ggcggcaacg cctggttgtt cggggccggc 660 

gggtccggcg gcgccggcac caatggtggc gtcggcgggt ccggcggatt tgtctacggc 720 

40 aacggcggcg ccggcgggat cggcggcatc ggaggtatag gcggcaacgg tggcgacgcc 780 

gggctgttcg ggaacggcgg cgccgggggg gccggggccg cgggcctgcc gggtgccgcc 84 0 

ggcctcaacg gcggcgacgg cagcgacggc ggcaacggcg gaaccggcgg caacggcggg 9 00 

cgcggcgggt tattggttgg caacggcggg gccggcgggg ccggcggcgt cggcggcgac 960 

ggtggtaagg gcggcgctgg cgatccgagt ttcgccgtca acaacggtgc cggcggtaac 1020 

45 ggcggtcacg gcggcaaccc cggcgtgggc ggggccggtg gggccggcgg cctgctggcg 1080 

ggtgcgcacg gtgccgccgg cgccaccccc accagcggcg gcaacggcgg cgatggcggc 1140 

atcggcgcca ccgccaactc acccctacaa gccggcgggg ccggcggtaa tggcggtcat 12 00 

ggcgggttgg tcggcaacgg cggcaccggc ggcgccggcg gtgccggtca tgcgggttcc 1260 

accggcgcta ccggtaccgc cttacaaccg acgggcggta acggcaccaa tggcggcgcc 13 2 0 

50 gggggccacg gcggtaatgg cggaaatggc ggcgcccagc acggcgacgg cggcgtcggc 13 80 

ggcaagggcg gtgccggcgg tagcggcggc gccggcggaa acggattcga cgccgccacc 1440 

ttgggttcgc ccggtgccga tggcggtatg ggcggcaacg gcggcaaggg cggtgacggc 1500 

ggcaaggccg gtgatggcgg agccggtgcc gccggtgatg tgaccttggc cgtcaaccag 1560 

ggtgccggcg gtgacggcgg caacggcggt gaagtgggcg ttggcggcaa gggtggggcc 1620 

55 ggcggtgtta gcgcgaaccc ggccctgaac ggttcggccg gggcgaacgg caccgcgccc 1680 

accagcggcg gcaacggtgg caacggaggt gccggcgcca cccccaccgt cgcgggagaa 1740 

aacggcggcg ccggtggtaa cggcggccat ggcgggtcgg tcggtaacgg cggtgcgggt 18 00 

ggtgccggcg gaaatggcgt cgccggcacc ggccttgccc tcaacggcgg caacggcggc 1860 

aacggcggca tcggcggcaa cggcggatcg gcggccggca cgggcgggga cggcggcaag 1920 

60 ggcggcaacg ggggcgccgg agccaacggc caagacttct ccgcgtccgc caatggcgcg 1980 

aatggcggac agggcggcaa cggcggcaac ggcggcatcg gcggcaaggg tggtgacgcc 2 040 

ttcgccacgt tcgctaaggc cggcaacggc ggtgccggcg gcaacggcgg caatgtgggc 2100 

gttgccggcc agggtggggc cggcggcaag ggcgccattc cagccatgaa gggtgcgacc 2160 

ggcgccgatg gcaccgcacc caccagcggc ggtgacggcg gcaacggcgg caacggcgcc 2220 

65 agccccaccg tcgcgggcgg caacggcggt gacggcggca agggcggcag cggcgggaat 2280 

gtcggcaatg gcggcaatgg cggggccggc ggcaacggcg cggccggcca agccggtacg 2340 

ccgggcccta ccagcggtga ttccggcacc tcgggcaccg acggtggggc tggcggcaac 2400 
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ggcggggcgg 
ggtaacggcg 
cccaatgcta 
ggcgccggcg 
5 gccacgggca 
ggggccggcg 
aaccccggta 
ggcaaggccg 

ggcggtaacg 

10 ggcgcggccg 
aacggcggca 
ggcggtaacg 
ggcgccggcg 
ggcggtgccg 

15 ggcggcgcgg 
ggtgccggcg 
ggcggcaacg 
ggcctcggcg 
ggctacggcg 

20 ggcggcaacg 
aacggcggcg 
ggcaatggcg 
aacggcggta 
aatgggggca 

25 gtcgccggcg 
ggccacggcg 
ggtgccgcca 
aatggcggca 
agcgccggca 

30 aacggcggca 
gccggcggca 

ggtggcagcg 
ggcgggggca 
tggcaacggc 
35 cggtgccggc 
cggcaagggc 
cgctggcgcc 



gcggcgccgg 
gccagggtgg 
acggcgcaaa 
gcaatggcgg 
ccggcggcga 
aaaacggcct 
ccggcggcaa 
gcaccggctc 
gcggcaatgg 
gcggcaacgg 
acggcggtgg 
gcggcaatgg 
gtggcggcgc 
gcggcttcgc 
gcgactttgc 
gggtcggcgg 
gcggtagcgg 
gcaacggcgg 
gtccgggagg 
gcggcgtcag 
acggcgggaa 
gtatcggcac 
gcagcaaaag 
ctggcctcaa 
tgtccttcgg 
gcgacggcac 
gcggctcagg 
acggcggcaa 
acggcggcca 
actccggcaa 
acggcggcgg 
gcaccacggg 
gcctgagcac 
ggcgctggct 
ggcaacggtg 
ggcgacggtg 
aacggcggtt 



cggaacactg 
catcggcggc 
cggcgagaac 
cgcgggcggc 
cggcggcaac 
aaacagcggg 
cggcggcaac 
cttgacgggc 
cggcaacggc 
cggcaacggc 
ggcgctgaac 
cggtgccggc 
cgttgacgtc 
tagcaccggg 
tagcggtgtt 
cttcggcggc 
cggcgacggc 
cgtctccgag 
ccccgaaggc 
caccacgggc 
cgtcggtttg 
cgacgcgggc 
cacgaccacc 
cggcgcgggc 
caacgctgtg 
gacgggcggc 
cgtcgtcaac 
cggctccgcg 
cggcggcggt 
cagcaccggc 
cacttccagc 
cggcgccggc 
aggccagtcc 
cggccggcaa 
gcggtggcga 
tcggccctgg 
ccagcaacgg 



gccggccacg 
gccggcgaga 
ggcggtagcg 
aaggcgcagg 
ggcggcgatg 
gccatgctgc 
ggcggcaacg 
ttggacggca 
ggcaagggcg 
ggctccggcc 
caggccggct 
atgatcaacg 
gccgcgacaa 
ttgggtggcc 
ggcggtgtcg 
cagggcggca 
ggtggcggca 
actgggtttg 
aatggcggcc 
ggcgacggcg 
ggcggtgacg 
ggtgccggag 
ggcaacgccg 
ggtgctggcg 
ggcggcgacg 
gccggcggca 
gtcaccgccg 
ggcgccggcg 
gccaccggcg 
gtcgcgggct 
gccgccggcc 
gcggccggcg 
ggcggcccac 
cggcggcgcc 
tgccggcaac 
ctccaccggc 
caacgctcgc 



gcggcaacgg 
gaggcgccga 
gtggtaacgg 
cggccgggta 
gcggcaaagc 
cgggcggcgg 
ccggcgtcgg 
ccgacggcat 
gcaccgccgg 
tcaacggcgg 
tcttcggcac 
gcggcctcgg 
cgggcggcgc 
caggcggcgc 
gcggcgccgg 
tcggcgggga 
tttccttagg 
gcggcgccgg 
tcggcggcaa 
gcgccggcgg 
ccggctccgg 
gggccggtgg 
gctccggtgg 
gggccggcgg 
gcggcaacgg 
agggcggcaa 
gccacggcgg 
gccagggcgg 
gcgacggcgg 
tggccggtgg 
acggcggcag 
gcaacggcgg 
ggcggcagcg 

ggtggtgccg 

gccggcfccag 

ggcgcgggcg 

ggtggcaacg 



tggtaagggt 
cggcgccggc 
tggcgacggc 
caccgacggc 
cggtgacggc 
caccgtagga 
cggcaccgga 
cacccccaac 
caacgggagc 
tgacgccggc 
gggcggcaaa 
cggcttcggc 
tggcggcaat 
cggcggtccc 
cggggacggc 
agggcgcaca 
cggcaacggc 
cggcaacggc 
cggcggcgcc 
caagggcggc 
cggcgcgggc 
cgctggcggt 
tgccggcggt 
caacgcgggt 
cggcaacggc 
cggcagcagc 
caacggcggc 
tgccggcggc 
caacggcggc 
tgccgccggc 
cggcggcagc 
cgccggtgct 
gtggtgccgg 
gtggcaacgg 
gcggcaatgg 
gcaagggcgg 
ccggtaa 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
.3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4617 



40 



45 



50 



55 



60 



65 



<212> Type t DNA 

<211> Length r 4617 

SequenceKTame : SEQ ID 530 
SequenceDescription : 

Sequence 



<213> Organi stnName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgtccttcg tggtcacagc accgccggtg ctcgcgtcgg cggcgtcgga tctgggcggt 60 

atcgcgtcca tgatcagcga ggccaacgcg atggcagcgg tccgaacgac ggcgttggcg 120 

cccgccgccg ccgacgaggt ttcggcggcg atcgcggcgc tgttttccag ctacgcgcgg 180 

gactatcaaa cgctgagcgt ccaggtgacg gccttccacg tgcagttcgc gcagacattg 240 

accaatgcgg ggcagctgta tgcggtcgtc gacgtcggca atggcgtgct gttgaagacc 3 00 

gagcagcagg tgctgggtgt gatcaatgcg cccacccaga cgttggtggg tcgtccgctg 3 60 

atcggcgatg gcacccacgg ggcgccgggg accgggcaga acggtggggc gggcggaatc 420 

ttgtggggca acggcggtaa cggcgggtcc ggggctcccg gacagccggg cggccggggc 480 

ggtgatgccg gcctgttcgg ccacggcggt catggcggtg tcggggggcc gggcatcgcc 540 

ggtgccgctg gcaccgcggg cctgcccggg ggcaacggcg ccaacggcgg aagcggcggc 600 

atcggcggcg ccggcggcgc cggcggcaac ggcgggctgc tattcggcaa cggtggtgcc 660 

ggcggccagg gtggctccgg cggacttggg ggctccggcg ggacgggcgg cgcgggcatg 720 

gctgccggtc ccgccggcgg caccggcggc atcgggggca tcggcggcat cggcggcgcg 780 

ggcggggtcg gcggccacgg ctcggcgttg ttcggccacg ggggaatcaa cggcgatggc 840 

ggtaccggcg gcatgggtgg ccagggcggt gctggcggca acggctgggc cgctgagggc 900 

atcacggtcg gcattggtga gcaaggcggc cagggcggcg acgggggagc cggcggcgcc 960 

ggcgggatcg gtggttcggc gggtgggatc ggcggcagcc agggtgcggg tgggcacggc 1020 

ggcgacggcg gccagggcgg cgccggcggt agtggcggcg ttggcggcgg cggcgcaggc 1080 

gccggcggcg acggcggcgc gggcggcatc ggcggcactg gcggtaacgg cagcatcggc 1140 

gggg cc g cc g gcaatggcgg taacggcggc cgcggcggcg ccggtggcat ggccaccgcg 1200 
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ggaagtgatg gcggcaatgg cggcggcggc ggcaacggcg gcgtcggtgt tggcagcgcc 1260 

ggaggggccg gcggcaccgg cggtgacggc ggggcggccg gggcgggcgg cgcgccgggc 132 0 

cacggctact tccaacagcc cgcgccccaa gggctgccca tcggaaccgg cgggaccggc 13 80 

ggcgaaggcg gtgccggcgg cgccggtgga gacggcgggc agggcgacat cggcttcgat 1440 

5 ggcggccggg gtggcgacgg cggcccgggc ggtggcggcg gcgccggcgg tgacggcagc 1500 

ggcaccttca atgcccaagc caacaacggc ggcgacggtg gtgccggcgg tgttggggga 1560 

gccggcggca ccggcggcac gggtggggtc ggggccgacg ggggtcgcgg gggggactcg 1620 

ggccgcggcg gcgacggcgg caacgccggc cacggcggcg ccgcccaatt ctccggtcgc 1680 

ggcgcctacg gcggtgaagg tggcagcggc ggcgccggcg gcaacgccgg tggcgccggc 1740 

10 accggtggca ccgcgggctc cggcggtgcc ggaggtttcg gcggcaacgg tgccgatggc 18 00 

ggcaatggcg gcaacggtgg caacggcggc ttcggcggaa ttaacggcac gttcggcacc 1860 

aacggtgccg gcggcaccgg cgggctcggc accctgctcg gcggccacaa cggcaacatc 1920 

ggcctcaacg gggccaccgg cggcatcggc agcaccacgt tgaccaacgc gaccgtaccg 1980 

ctgcagctgg tgaataccac cgagccggtg gtattcatct ccttaaacgg- cggccaaatg ^2040 

15 gtgcccgtgc tgctcgacac cggatccacc ggtctggtca tggacagcca attcctgacg 2100 

cagaacttcg gccccgtcat cgggacgggc accgccggtt acgccggcgg gctgacctac 2160 

aactacaaca cctactcaac gacggtggat ttcggcaatg gccttctcac cctgccgacc 2220 

agcgttaacg tcgtcacctc gtcatcaccg ggaaccctgg gcaacttctt gtcgagatcc 2280 

ggtgcggtgg gcgtcttggg aatcgggccc aacaacgggt tcccgggcac cagctccatc 2340 

20 gttaccgcga tgcccggcct gctcaacaac ggtgtgctca tcgacgaatc ggcgggcatc 2400 

ctgcagttcg gtcccaacac attaaccggc ggtatcacga tttctggagc accgatttcc 2460 

accgtggctg ttcagatcga caacgggccg ctgcaacaag ctccggtgat gttcgactcc 2520 

ggcggcatca acggaaccat cccgtcagcc ctcgccagcc tgccgtccgg gggattcgtg 2580 

ccggcgggaa cgaccatttc ggtctacacc agcgacggcc agacgctgtt gtactcctac 2640 

25 accaccaccg cgacaaacac cccatttgtc acctccggcg gcgtgatgaa caccgggcac 2700 

gtccccttcg cgcagcaacc gatatacgtc tcctacagcc ccaccgccat cgggacgacc 2760 

acctttaact ga 2772 
<212> Type : DNA 
<211> Length : 2772 

30 SequenceName : SEQ ID 531 

SequenceDescription : 



Sequence 

35 <213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

ttgatcggca acggcggggc cggcgggtcc ggggcgcccg gcgccatcgg tggggccggc 60 

gggcccgcgg ggttgatcgg tgtcggaggt gccggcgggg ccggtggaga ctccgcggtc 120 

gcgggtgtca tcggaggggc cggtggggca ggcggggctg ccctgctgtt cggtgccggt 180 

40 ggggccggcg gggccggggg ttccggcggt tccggcgcag ctggtggggc cggtggcgcc 240 

ggtggggccg gcgggctgtt cgccagcggc ggcagcggcg ggttcggcgg gttcgcatcg 3 00 

acgggcaccg gtggggccgg cggcaccggt ggggctggtg ggttgttcgc cagcggcggg 3 60 

gtcggcggta ctggcggggg agccgggtcc ggcggtaccg gtggggttgg tgggacgggt 420 

ggggccggag ggctgttcgc tagcggcggc gctggcgggg ccggcgggtc cggcggtacc 480 

45 ggtggggctg gtgggacggg tggggccggc gggctgttcg gagccggtgg cgctggcggg 540 

ctcggcgggc aaggcaacca caccggcggg cacggtgggg ccggtggcag cgccggcctg 600 

ctcgcccttg gcgacggcgg cgctggcggg gccggcgggg ccgctaccac cggaaccggc 660 

ggggccggcg gggcgggtgg caaggccggc ctgctgttcg gctccggtgg ggccggtggg 720 

tccggtgggg ctgccggcac cttcggtgac accggtaact ccggcggggc cggtggggcg 780 

50 ggtggcaagg ccggcctgct gttcggctcc ggtggggccg gtgggtccgg cggcgctggg 840 

ggcttcgcca acggctctac cggcggtgcc ggcggggccg gcggcggggc cgggctgatc 900 

ggcaacggcg gcaacggtgg cagcggcggc acgtcggttg ccaccggggg ggccgggaac 960 

ggcggtgccg gcggcgccgg cggcggggcc gggctgatcg gcaacggcgg caacggcggc 1020 

agtggcggaa tgggcgatgc cccgggcggc accggcgtcg gcggcatcgg tgggctgttg 108 0 

55 ttgggtttgg acggcgccaa cgccccggcc agcaccaacc cgctgcacac cgcgcagcag 1140 

caggcgttgg ccgcagtcaa cgcgcccatc caggccgtga ccgggcgccc gctgatcggc 12 0 0 

aacggcgcca acggcgcccc gggcagcggg gcccccggcg ggcacggcgg gtggttgttc 1260 

ggcggcggag ggaccggcgg gtccggcgtc agcggcgggg cgggcggaga tggcggggcc 132 0 

ggcgggatct tgttcggcgc cggcggggcc ggcggcgcgg gcggggccgt cacgggaacc 1380 

60 ggcgccaccg gcgggtccgg tggggccggc ggtggagcct tgctgtttgg ggccggtggg 1440 

gccggtggag ccggcgggtc cagcgggatt ggcgggttcg ccgcgggcgg ggccggtggg 1500 

cccggagggg ccggtgggct gttcaacggc ggcggggccg gcggggccgg cgggtccggc 1560 

gtcagcggcg gggctggcgg ggagggcggg gccggcgggg ccggtggcct gttcgccggt 1620 

ggcggggccg gcggggccgg cggatcgggc aacaacgtcg ggggggccgg cggggccggt 1680 

65 ggggtcggtg ggctgttcgg ggccggcggg gccggcggat ccggcggcgg cggtagcgtt 1740 

gctggcgaca gtggggccgg cggcaacgcg ggcttgctcg cccccggtct cgccggcggt 1800 

gccggcggtg gcggcgggca gggttttgac accggcgggg ccggcgggcc cggcggcgac 1860 
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gccggcctgc tggtcggctc cggcggggtc ggaggtgccg gcggattcgg cctcactacg 1920 

ggtgggcctg gggcggccgg cggcgacgcc ggcctgctgt tcggctccgg cggcgctggc 198 0 

ggggccggcg gctccggccg aaccgacctc ggcggcgctg gcggagccgg cggcaaggcc 2 040 

gggctgatcg gcaacggcgg taacggcggg gccggcgggg ccggcgggaa cggcggcggg 2100 

5 gacggcgggc ccggtggagc cgccttcggg ctcggtaacg gcggcaacgg cggcaacggg 2160 

gggaccggca cgtccgcggg cagccccggt gccggcggcg ccggtggttc gctgatcggc 2220 

gcggaggggc tgcccgggct gctgccctag 2250 
<212> Type : DNA 
<211> Length : 2250 
10 SequenceName : SEQ ID 532 
SequenceDescription : 

Sequence 

15 <213> OrganisraName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgtcgtttg tgattgctgc gccggaggcg ttggtcgcgg tcgcttcgga tctggcgggc 60 

attgggtcgg cgctggcgga ggccaacgcc gcggcgttgg ccccgacgac ggcgttgttg 120 

gccgcgggtg ccgatgaggt gtcggcggcg atcgcggcgc tgtttggcgc gcacgggcag 180 

20 gcgtatcaga cggttagcgc ccaggcgtcg gcgtttcatg cccagtttgt gcaggcgttg 240 

actggcggcg gcggggcgta tgcggctgcc gaggccgcca acgtctcggc ggcgcagagc 300 

accgaccagc ggctgctcga tctgatcaat gggcccaccc aggcgttgtt ggggcgtcca 360 

ctgatcggtg atggcgccaa cggcgggccg gggcaagacg gcgggcccgg ggggttgctg 42 0 

tacggcaacg gcggcaacgg cggcactagt accaccgccg gggtggccgg cggcaacggt 480 

25 ggcgccgccg ggctgatcgg caacggcggg gccgggggcg gcggcggggc cggcgcggcc 540 

ggcggcaatg gcggtgcggg cgggtggctg tatggcaacg gcggcgccgg cggggccggt 600 

gggacatcgg tgatacccgg tgtcgccggc ggcaatggcg gggctggcgg gtccgcggga 660 

ctgtggggta ccggcggggc cggtggcgac ggcggcaacg gccggtcggg gccagtcaac 720 

gtcgccggca gcgcgggcgg caacggtggc gctggtggcg ccgccgggtt attcggtgac 780 

30 gccggggccg gtggcaacgg cggcaagggc ggtgctggcg gcgccgcctt tagcattaac 840 

ttcaccgcag gcgatggcgg tgcgggaggt gccggtgggt ccggcggcca cgcattgctg 900 

tggggcgccg gcggagccgg gggtaacggc ggatccggcg gcacgggggg tgccggcggc 960 

agcaccgctg gcgctggcgg caacggcggg gccgggggtg gcggcggaac cggtgggttg 102 0 

ctcttcggca acggcggtgc cggcgggcac ggcgccgccg ccggaaacgg cttagccgcg 10 80 

35 ggtaafcggcg tcagcagcag cggcggcggc ggtgccggtg ggaccggcgg ggccggtggg 1140 

gacggtggcg ccggcggggc cggaggcaac gccag'gctgt ggggcgtcgg tggcgccggc 1200 

ggggccggcg gggacggtgg cgccggcggg gccggcggca aaggcggctc tggcctcagc 1260 

ggtaacgcca acggcggggc cggcggcgac agcggccgtg gcggcacggg cggcgccggc 132 0 

ggcgagggcg gcgccgccgg gctgctggtg ggcaccggcg ggcacggcgg tgacggcggg 1380 

40 gccggcggcg ccgccgtcaa gggcggtgac ggcggggccg ccgccggcac gggcatcgcc 1440 

ggcgctggcg gccgtggcgg cgcgggcggc agcggtggca gcggtggtga cggcgggggc 1500 

ggggccgccg gccccgccgg gtggctgttc ggcgatggcg gggctggcgg gaacggcggg 1560 

gccgcggccg ccggcggcgc cggcggccaa gccggcggtg gcggcgggaa cggcggcaat 1620 

ggcggcaacg gcggcaatgg cggcaatggc ggcaacggcg ccaccggggg gtggctgtac 1680 

45 ggcaacggcg gggccggcgg ccagggcgcc accgccggag ccggcggagc cggcgctaac 1740 

ggcgtcagca gcaccaatgg cggcggcacc ggcggcaacg gggggatcgg cgggaccggt 1800 

gggtccggcg gggccggtgg caacgccggg ctgttgggcg tgggcggcgc cggcgggcac 18 60 

ggcgcctccg gcggcgccgg cgataggggc ggcgctggcg gtaccgggtt cataagcagt 1920 

gacggcggtg ctggcggtga tggcggtgat ggcggcaacg gcggggccgg cggcaccggt 1980 

50 gggctgttgt tcggtgccgg cggcaatggt ggccccggcg ggtctggcgg tgccgccgat 2040 

attggcggca acggcggcgc cggtaacggc gggggcaccg acgggaacgg cggtaatggc 2100 

gggtccggcg gcggcgccgg cagcggcggt gacggcggcg gggctggcgg caacggtgcg 2160 

tggctgttcg gcaatggcgg cgccggcggg ggcggcggaa aaggcggcaa cggtgccggc 2220 

ggcgggcttg gcggcggttc attcggcctc cccggcctga acggcagcgg cggcgacggt 2280 

55 ggcgacggcg gtaacggtgc ccccggcggg gtgctgtatg gcaatggcgg cgccggcggc 2340 

caggggtcaa gcggtggcat cggcggcccc ggcgccaccg gcggtgccgg cggcaaaggc 2400 

ggtgatggtg gcgatgcgca gctgatcggc gacggcggca atgggggcaa cggaggcgcg 2460 

ggcggcaccg ggggcacccc ggggcccggc ggacccggcg ggtccggcgg gcttggaggc 2520 

ctgctgttcg gccaaaccgg cacggctggc gtgtcgccgt ag 2562 

60 <212> Type : DNA 

<211> Length : 2562 

SequenceName : SEQ ID 533 
SequenceDescription : 

65 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
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<400> Pre Sequences t ring : 

atgtcctatc tcgtcgtggt gccggagttg gtcgcagcgg cggcaacaga tttggcgaac 60 

atcggttcgt cgattagtgc agccaacgcg gccgcggcgg caccgaccac ggcactggtc 120 

gcagccggcg gcgacgaggt atcggcggcc atagccgcgt tgttcggagc gcatgctcgg 180 

5 gcatatcaag cgttgagtgc ccaggcggcg atgtttcatg aacagtttgt ccgggccctc 240 

gccgccggcg gtaactccta cgccgtcgct gaggcggcaa ccgcgcaatc ggttcagcaa 300 

gatctgctca acctgatcaa tgcgcccacc caggcgctgt tggggcgtcc gctgatcggc 360 

aacggcgcca acgggctgcc gggtacgggc cagaacggcg gcgacggcgg gattctgtac 42 0 

ggcaacggcg gcaacggtgg gtccggcggg gtcaaccagg ccggtggcaa tggcgggaat 480 

10 gctgggctgt ggggcaatgg cggatccggc ggagccggcg ggaacgccac cactgccggc 540 

cgcaacggct tcaacggggg cgccggggga agcggcggtt tgctgtgggg caatggcggt 600 

gccggcgggg ccggtgggaa cggcggtccg gctccgctcg tgggcggggt gggcaccacc 660 

ggtggcgccg gcgggaacgg cggcggcgcc gggttgttct acggtttcgg cggcgccggt 72 0 

ggoaaqggcg ggatgggcgg ggtggcaccg agcaccggcc cctcgatggg ,catcctcccg 78 0 

15 gccggcggtg tcggcgggcc tggtggctcc ggcggggcga gcgcgcttgc cttcggctcc 840 

ggcggcgtcg gcggtgccgg tggcttgggc gggccgaccg atggcaccgt ccagggggtg 90 0 

ggcggcttcg gcggtcaggg cggcaacggc gggcagagcg gcttgttgtt tggcaacgcg 960 

ggagccggcg gggcaggcgc tgccggcgga gccggcaccg gcgacaccga gagcttcggc 102 0 

ggccacggcg gggccggcgg tgatggcggc gctgttggct tgatcggtaa cggcggggcc 1080 

20 ggcggcaccg gatctcccgg cgctgtggtg ggtggtaacg gcggcgtcgg tggtctgggt 1140 

ggcgccggca gtcccggggg tctgttgtac ggcaccgggg gggccggcgg caatggcgga 12 0 0 

ccgggtggtg acggtggtac tggcgcgacg gtgggctttg ccggctccgg cggtttcggc 1260 

ggtgcggggg gcatcgccca gctgtttggc acgggtggca tgggtggtag cggcggtggt 132 0 

ataggcgctg gcaccacgac cgtggtgccg cccgacgtcg ccccggtggg tggcacaggc 138 0 

25 ggcaatggcg gtcgcgccgg gctgctgttg ggtgtgggtg gcatgggcgg taatggcggt 1440 

gccaccagcg tcggcgggac gctctacgcc gccggtggaa acggcggcga cggcgggttg 1500 

gtgtggggca acggtggcac cggcgggagc ggtggcgccg gcggggcggg cagcgtcggc 1560 

aacggcggtg cgggtggcaa cgcggcactg ctgttcggca acggcggggc gggcggggcc 1620 

ggcggcgccg gcggcatcgg tgccggcgga gccggcggct tcggcgcggt tctgtttggc 1680 

30 aacggcgggg ctggcgggag cggtgccccc ggtggcatcg gcgccggtgg caatggcgga 1740 

aacgcgctgc tggtcggcaa cggcggcaac ggtggggcag gtaccggtgg ggctgctggc 18 0 0 

ggtgccggtg gctcgggcgg gttgctattc ggccaaaatg ggatgcccgg gccgtga 1857 

<212> Type : DMA. 
35 <21X> Length : 1857 

SequenceName : SEQ ID 534 
SequenceDescription : 

Sequence 

40 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgaattttt ctgtactgcc gccggagatc aattcagcgc tgatattcgc cggggcaggg 60 

ccggaaccga tggcggcggc cgcgacggcc tgggacgggt tggccatgga attggcctcg 120 

45 gccgcagcct ctttcggctc agtgacatcc ggactcgtgg gcggggcgtg gcagggcgcg 180 

tcgtcgtcgg cgatggcggc agcggcagcc ccctatgcgg cgtggcttgc cgcggcggcg 240 

gtccaggccg agcagacggc cgctcaggct gcggcgatga tagccgagtt tgaagcggtc 300 

aagacggcgg tggtgcagcc gatgctggtg gcggccaacc gtgccgacct ggtgtcgctg 3 60 

gtgatgtcga acctgtttgg acagaacgct ccggcgatcg ctgccattga agccacgtac 420 

50 gagcaaatgt gggctgccga tgtgtcggcg atgtctgcct accatgccgg ggcatcggcg 480 

atcgcctcgg cgctgtcccc gttcagtaaa ccgctgcaga acctggctgg cttgccggct 540 

tggttggcca gcggcgcgcc tgcggccgcc atgaccgcag ccgcaggcat accggcgctt 600 

gcgggcggac ccaccgccat caacctgggc atagccaacg tcggcggtgg caacgtcggc 660 

aacgccaaca acggccttgc caacatcggc aacgccaacc ttggcaacta caatttcggg 720 

55 tccggaaatt tcggtaactc caatatcggc tcagcaagcc tgggtaataa caacatcggc 78 0 

ttcgggaacc tcggcagcaa caatgtcggc gtgggaaacc ttggcaatct caacaccggg 840 

tttgccaaca ccggcttggg caacttcggc tttggcaaca ctggcaacaa caacatcggc 900 

atcggtctta ccggcaacaa ccagatcgga atcggcgggc tcaactcggg caccgggaat 960 

ttcggattgt tcaactcggg cagcggaaac gtcggcttct tcaactccgg caatggaaac 1020 

60 tttggcatcg gaaactcggg taatttcaac accggtggct ggaattctgg acacgggaac 1080 

acgggcttct tcaatgcggg ctcgtttaac accggtatgt tggacgtcgg caacgcgaac 1140 

acaggcagcc tgaacaccgg cagttataac atgggcgact tcaatccggg gtcgtccaac 1200 

accggcacgt tcaacacggg aaatgctaac acgggtttcc tcaacgccgg aaatatcaac 12 60 

actggtgtct tcaatattgg ccacatgaat aatgggctgt tcaacacggg tgacatgaac 1320 

65 aatggcgtct tctaccgggg cgtggggcag ggcagcctgc agttcagtat tacgacacct 1380 

gatctgactc tgccgccgct gcaaataccg gggatatcgg ttcccgcctt cagtctgccg 1440 

gcaataacgc tgccgtcgct gaacatcccg gccgccacca caccggccaa catcaccgtc 1500 



4'-' 
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ggcgccttca gcctgcccgg gttgacgttg ccgtcgttga acatcccggc 

ccagccaaca tcaccgtggg tgccttcagc ctgcccgggt tgacgttgcc 

atcccggccg ccaccacacc agccaacatc accgtcggcg ccttcagcct 

acgttgccgt cgttgaacat cccggccgcc accacaccag ccaacatcac 

ttcagcctgc ccgggttgac gttgccgtcg ttgaacatcc cggccgccac 

aacatcaccg tcggcgcctt cagcctgccc gggttgacgt tgccgtcgtt 

gccgccacca cacccgccaa catcaccgta agcggctttc agttgcctcc 

ccttccgtag ccattccgcc ggtgacggtc ccgcccatta cggtgggtgc 

ccgccattgc agattccgga agtaactatt ccgcagctga cgatacccgc 

atcggtggct fctagtctacc tgcgatacat actcaaccga taacggtcgg 

gtgggccaat ttggcctgcc ctccataggc tgggatgttt tcctaagcac 

acagtaccgg cttttggaat accctttacc ctacaattcc agaccaatgt 

cagccgcccg gcggcgggct tagtactttc accaatggcg ccctcatctt 

gacttaccac aattggtggt tcacccatac acattgaccg gccctattgt 

ttctttctgc ccgccttcaa catacccggg atcgatgtcc ccgctatcaa 

ttcaccctgc cgcagatcac caccccagct atcaccaccc cggagttcgc 

atcggcgtgg gcggcttcac tctgccgcag atcaccaccc aggaaatcat 

ctaaccatca actcgatcgg cgtcggcggg ttcaccctgc cgcaaatcac 

atcaccaccc caccgctgac catcgacccc atcaacctca ccggcttcac 

atcaccaccc cacccatcac caccccaccg ctgaccatcg accccatcaa 

ttcaccctcc cccaaatcac caccccaccc atcaccaccc caccgctcac 

atcggcgtgg ggggcttcac cacgcccccg ctcaccgttc ccggcatcca 

accacgatcg gggccttcgc gatccccggg gggccgggct acttcaactc 

ccttcgtcgg gcttcttcaa ttccggtgcg ggcggcaact cgggcttcgg 

tcgggcctct cgggttggtt caacaccaac ccggccgggc tgttgggcgg 

cagaacttcg gcgggctatc ctcgggcttt tccaaccttg gcagcggcgt 

gccaacaggg gcatcctgcc gttctcggta gccagcgtcg tttccggctt 
tggcgggttt cttccaaggc accacgtcct aa 
: DNA 
<211> Length : 3162 

SequenceName : SEQ ID 53 5 
SequemceDescription : 

Sequence 



ggcaccaacc 
<212> Type 



cgccaccaca 
gtcgttgaac 
gcccgggttg 
cgtcggcgcc 
cacaccagcc 
gaacatcccg 
gctgagtatt 
ttttaatttg 
gggtatcaca 
ccagattggc 
acctaggata 
gcctgcgctt 
cggtgagttt 
catcggttca 
cgtcgatggc 
gatccctccg 
caccccggag 
caccccaccc 
cctcccccaa 
cctcaccggc 
catcgagccg 
cctgcccagc 
gagcaccgcg 
caacaacggc 
ctcgggctat 
ctcaggcttc 
tgccaatatc 



<213> Organd-smName r Mycobacterium tuberculosis H3 7Rv 
<400> PreSecjuenceString : 

atgttgtatg tagttgcgtc acccgacttg atgaccgcgg cggctaccaa 
attggttcgg cgatcagcac ggcaaatggt gcggcggcac tcccgactgt 
gccgcggccg crcgacgaggt gtccacgcag atcgcggctc tattcggagc 
agctaccaaa ccctcagcac ccaggcagcg gcgtttcata gtcggtttgt 
accacggccg cggcttccta cgccagcgta gaggccgcca acgcgtcgcc 
gcgctagacg tgattaatgc gcccgcccag acactgctcg gacgtccgct 
ggcgccgacg gatcgacacc ggggcaggcc ggcgggcccg gcgggttgct 
ggcggtaatg cgcgccgccgg tgggcccaac caggccggcg gcgccggcgg 
ttgatcggca acggcggggc gggcggcgcc gggggtgttg gcgcggtcgg 
ggcacgggcg cgcctgctatt cggcaacggc ggggccggcg ggcaaggcgg 
gcaggtatca acggcggcag cggcgggcag ggaggccacg gtggcaacgc 
ggccagggcg gtgccggcgg gccaggtggc accggcgcca tgggcgtcgc 
cccaccccca tcggcaccgc agcgcctggc agcgacggcg taaatcagat 
ggtaacacgg acctcaccgg cggcgccggt ggcgacggca atgccggcag 
aacggcggca acggcggtac cggcggcgca gctaggaact catctggtgg 
tcctttggtg gtgccggcgg cgccggaggc gacggcgcca acggcggcga 
ggcggggaag ccctcaccga aggcggtgcc accgccgtta gtggtgctgg 
ggtaacgccg aggcttccgg cggcgccggc ggcaacggcg gcaaaggtgg 
gccaccacca gcgtgaccgg gggtaacggc ggtaacggtg gcaatggcca 
gcgccgggcg gcgctggcgg cagcggtggc 
ctggccggca acggcggcac cggcggtgcc 
gcccccggcg cgtgccggcgg cgccggcggc 
aatgccaccg taaccggggg caatggcggg 
accggggggg ctgggggtgc cggaggtcta ggtggtcacg ggggtgcagg 
attggcaacg gcggcgccgg tggcgctggc ggcctcggcg gtgcgggcgg 
gcgggcggtg agggcggtgc cggcggcgcc ggaggcgaag ctattcccgg 
accaactccg ccggcggtga cggaggggcg ggcggtactg gcggcaatgg 
ggtgccggcg gagcccccgg cctcggtggc gcgggcgggg ccggcggatg 
cagtcgggca gcaccggcgg cggtggcgcc ggcggtgccg gtggtgccgg 
ggcgcgggcg gcagcggcgg tgcgggtggc catggcgaca ctacctccgg 



gtcggcggtg acggcggccg 
ggtggcaacg gcggtaccgg 
aaagccgaca tcgccaacag 
acaggcggag acggcggcag 



tctggcggag 
tgaggtggtg 
gcatgccagg 
gcaggcgttg 
acttcaggtt 
aattggtaac 
gtacggcaac 
caacgccggc 
cggtaaacgc 
gctcggcctc 
catcctgttc 
cggcaccaat 
tgggaacggt 
caccaccgtg 
taccggtaac 

cggtggcgct 
tggtaaggga 
ctttgctcag 
cgacagtaac 
tggcggcctg 
tggcgccggt 
cctcggcgac 
cgcgctgggc 
cgggctgctg 
cgccggcggt 
cggggcgtcc 
cggtgacggc 
gttgatcggc 
aggtgccggt 
caagaacggt 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
-.2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3162 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
•960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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tcgtctggca ccgcgggctt cgacggcaac cccgggcagc ccggctga 1848 
<212> Type : DNA 
<211> Length : 1848 

SequenceName : SEQ ID 536 
5 SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
10 <400> PreSequenceString : 

atgcattact cagtgttgcc gccggagatc aactcggcct tgatcttcgc cggggcgggc 60 

tccgga.ccga tgctggcggc ggcgtcggcc tgggacgggc tggcaaccga attagcctcg 12 0 

gctgctgtct ctttcggctc ggtgacagcc gggctggtcg gcgggtcgtg gcagggtcgg 180 
tcatcggtgg cgatggcagc ggcggcagcc ccgtatgcgg ggtggctggc cgcggcggcg - -.240 

15 acccaggccg agcaggcggc cacccaggcc caggtgatgg tggccgagtt cgaggctgtg 3 00 

cggctggcga tggtacaacc ggcgctggtg gccgccaacc gttccggcct catatcgctg 3 60 

gtgatatcga acctttttgg tcaaaacgct cccgcgatcg cggccgccga agccgcatac 420 

gaggacjatgt gggctctgga tgtatcggcg atggcggcct accattccgg ggcgtcggcg 48 0 

gtcgctgtgg cgctaccggc attcgccctc ccgctgcggc ttccggcggg tctggcggcc 540 

20 gggcccgcgg ccgtggtgac cgcgctcacc acggccgtgg gcatgccgac ttttgccggc 600 

cgggccjatcg ccgctagcct eggcttggcc aacgtcggtg gtggcaacct cggcaatgcc 660 

aacaafcgggc tcggcaacat cggcaacgcc aaccttggca acaacaatct ggggtccggc 72 0 

aacttcggta gcttcaatat cggctcggcc aacctaggtg gcaacaacat cggcatagga 780 

aacgcgggcg ccaacaactt cggacttgca aacctgggca atttgaacac gggattcgcc 840 

25 aatgcaggca tcggcaactt cggaattgcg aacaccggca acaacaatat cggcaacggc 90 0 

ctgacfcggaa acaaccaaat cggcattggc ggactcaatt ccggcaacgg taacgtcgga 960 

ttattcaacg cgggtagcgc caatatcggt ttcttcaact ccggcaatgg caactttggc 1020 

atcggcjaact ccggtaacfct cagcactggc ctgttcaacc ccggacacgg caacaccgga 10 8 0 

ttcctgaatg cgggctcttt caatacgggc atgttcgacg ttgggaacgc gaacaccggc 1140 

30 agcttcraacg tcggccacta caacttcggt gccttcaacc cgggcccgtc gaacacgggt 12 00 

accttcraaca cgggcggcgc caacaccggc tggttcaaca caggaagcat caacaccggc 1260 

gccttcaaca taggcgacat gaataacggc ttgttcaaca cgggcgacat gaacaatggt 1320 

gtctttitacc gtggtgtggg ccaaggcagc ctgcagttcg ccatcaccag ccctgatttg 13 80 

acgctfcccgt ctctggaaat acccggaatc tcggttcccg cgttcagcct gcccgcgata 1440 

35 accttgccgt cgtfcgacgat tccggcggtg acgacgccgg ccaacgttac cgtgggtgcg 1500 

tttgatttgc cggggttgac ggtgccgtcg ttgacgattc cagcggcgat gacgccagct 1560 

aacatcracgg tgggtgcgtt tgatttgccg gggttgacgg tgccgtcgtt gacgattcca 1620 

• gctacaacga caccagccaa catcacggta ggtgcgttta acttgcctca gttgagtatt 1680 

ccgtccjgtga cggtfcccgcc gatcacgatt ccggctggca cagcgctagg tgcgttcaat 1740 

40 ctgccg-acgc tgagtattcc gtcggtgacg gttccgccga tcacgattcc ggctggcacc 1800 

actgtcggcg gatttacgct acccacgata cacaccccgt taataagtac accccaaata 18 60 

agtataggcg gctttagcac tcccggcata gccacgcaag caaattctgg tgtcatcaat 1920 

cttcccacct ttagccttaa cggcattacg ataactaatt tggtggtgtt cattccgaac 1980 

aacatcactg ccttgcaaac caatatgccc ggggtattcc cgcagattgg cggcttcgct 2040 

45 aatacacctc ctgcctttat taatactggg accattaccg tgggtggagg tcaaatcaac 2100 

ggcgtcggct tctcgatcgg cgcaatcaac gtcaccccct tcaccctccc caacgtcgtc 2160 

atccaaccgt ggtccctcgg ggggatctcg gtcgacgggt tcaccctgcc agagatcagc 2220 

acccaagaat tcaccactcc ggcgttgacg atcagtccga ttggtgtcgg tgcattgagc 22 80 

ctgccggata tcactactca acagttcacg accccggagt tgaccatcga cccgatcacg 2340 

50 ctgggtgggt ttacgctgcc gcagctcagc atcccggcga ttaccacccc ggcgttcacg 2400 

atcgafcccga tagcgctggg tggtttcacg cttcctcaga tcatgacgcc cgagataacg 24 60 

actccaccgt tcgccatcga cccgatcgga cttagcggtt tcaccctccc ccaggtcaat 252 0 

atcccggaga tcaccacgcc agagttcacc atccagccgg tgggcttggc ggccttcacc 2580 

acacccgcac tcaccatcgc cagcatccac ctgccgagca ccaccatggg cggattcgca 2640 

55 atcccagcgg ggccgggata cttcaactcg agcgcaacgc cctcgttggg ctttttcaac 2700 

gccggaatcg gtgggaactc gggcttcggc aacagcggct cgggactgtc gggttggttc 2760 

aacacaagtc ctgttgggct gctagccggc tcgggctacc agaactacgg tggtcttatc 2820 

tccggcttct ccaaccttgg cagcggcata tcgggcttcg ccaacaccgg caccctgccg 28 80 

tttgccgtga ccagcttggt ctccggtttg gccaacatcg gcaacaacct gtcgggcctg 294 0 

60 ttcttccaga gcaccacgcc ataa 2964 
<212> Type : DNA 
<211> Length : 2964 

SequenceName : SEQ ID 537 
SequenceDescription : 

65 



Sequence 
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<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<40 0> PreSequenceString : 

atgtcgtttg tagtcgtggc gccggaggtg ttggcggcgg ccgcttcgga tctagcgggc 60 

atcgggtcga cactggcgca ggccaacgcc gcggcgttgg cgccgaccac cgcggtgttg 12 0 

5 gccgcgggtg ctgatga^gt ttccgcggca atcgcgtcgc tgtttggggc gcatggtcag 18 0 

gcgtatcagg cggtgagcgc ccaaatgtcg gcgtttcacg cccagttcat gcaggcgttg 240 

acgggtgccg gcggggctta tgcggctgcg gaggcggtca acgtctcggc ggcgcagagc 3 00 

gtggaacaag acctgttcjgc cgcgatcaac gctcgcttcg agcggatttt tgggcgcccg 360 

ctgatcggtg atggcgccaa cggcgggccg ggacaagacg gcgggcccgg cgggttgctg 420 

10 tacggcaacg gtggcaacgg cggcaccagc acgaccgtgg ggatggccgg cggcaacggt 480 

ggtgccgccg ggctgatcgg caacggtggg ttcgggggcg gcggcgggcc cggcgcggcc 540 

ggcggcaacg gcggcgccgg cgggtggcta ttcggcaacg gcggcgccgg cggtgccggc 600 

ggcctcggcg tagcgcccgg cgtgcccggc ggcgccggcg gtgccggcgg cgccggcggt 660 
gtcggcggac ccgc.cgggtt gtggggccac gggggtgccg gcggggcggg tggtgccggc - 720 

15 gtggctggcg ccggcggctt cgaggggacg atcggtgccg gcggtgccgg cggtgtcggc 78 0 

ggtgccggcg gtgtcggcgg tgccggcggt gccggcgggt ggctgtacgg cgacgccggt 840 

gccggtgggg atggtggfcgt cggcggtgcc ggcggcaccg gcgggttagg caaccgtggc 900 

ggcgccggtg gcgccggggg cgccggtggt gtcggcggcg ccgggggtgc cgccgggctg 960 

tggggcggcg gtggtgccgg cggggtgggt gggaccggcg gcggcgccgg cctcggtgct 1020 

20 cagagcgtca ccttcagtag tagcttaagt ggcctttccg gtggcgacgg cggcgccggc 10 80 

ggggccggtg gcgccggfcgg cgccggtggc accggtgggt ggctgtatgg cggcggtggt 1140 

gccgccggat ccggcgggga cggtggtacc ggcggtcagg gcggcgccgg cggcgccggt 1200 

gtatttagcc tattcggatc cggtggcggc cccggcggca acggcggcgt cggcggcgtc 12 60 

ggcggtgtcg gcggtgcfcgg cgggcgtgcc ggcttgttcg gcgtcggggg cctcggcggc 1320 

25 gcgggtggcg acgccggtiga ctccggcgaa ggcggcttcg gcgggccggg gctcgccggc 1380 

gggctgttcg gcaaccccgg caacggcggc gtcggcggga tcggcggcga cgccgcagcc 1440 

ggcggcgccg gtggggccgg aggcaacggt ggggccggag gcaacggtgg gtggttgttc 15 00 

ggcaatggtg gtgccggcgg ctccggtggc gacggcggcg ccgccggccg tggcggtgcc 1560 

ggcaacttgg gctcggccgg gggtatcaac gcccccgccg gtaaccccgg cagcggctcg 1620 

30 gtcggcatcg gcggtgccgg tggtgccggc ggcaccgccg ggctgttcgg cgacggtggg 1680 

gctggtgggg ccggtggfcgc cggcgccgcc ggcggcttcg gcggcatcag cgccgccacc 1740 

ccctcggcgg gcagtgaggg cgccatgggt ggggccggtg gtgttggcgg caacgccagg 18 00 

ctgttgggca ctggtggcgc cggtggagtc ggcggcggcg gcggggccgg cggcgacgga 1860 

ggccgcggcg gagtcgcaac ccccggcggt cagggcggtg acgctgggga cggtggcgcc 1920 

35 ggcggggccg gcggcaatgg cggcggcgcc agcggcgccg gcgggtggct gttggggacc 1980 

ggtggtgccg gtggtgccgg tggtaacggc ggcaatggcg gaaaagccgg ttttagccct 2040 

gggccgacca acttcggfcct caacggcgcc ggtggtggtg gtggtgtcgg cggcaacggc 2100 

gccaccggac cctggctgtt cggcgacggc ggccccaccc caggcagcac cggtgccggt 2160 

gcggccggtg gtcacggcgg cgacgcccag ctgatcggca acggcggcca cggcggggcc 2220 

40 ggcggcaccg gggtgccgaa cgggtcaggt ggtgccggcg gcctcagcgg gctgctgttc 2280 

ggcgagccgg gggcgaacgg gtag 23 04 
<212> Type : DNA 
<211> Length : 2304 

SequenceName : SEQ ID 538 

45 SequenceDescrription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 

50 <400> PreSequenc eSt ring : 

atgtcgtttg tgatcgcgaa ccccgagatg ctggcagcgg cggcgaccga tttggccggc 60 

atccggtcgg cgatcagcgc cgcgaccgcg gcggccgcgg ccccgacgat ccaggttgcc 120 

gcggccggcg ccgacgaggt gtcgctggcc atctcggcgc tgtttggcca gcacgcccag 18 0 

gcctatcagg cgctcagcgc ccaggcgacg atctttcacg accagttcgt gcaggccctg 240 

55 acctccggcg gcaacctgjta tgcggccgcc gagagccaca ccgtcgagca gatggtgctc 3 00 

aacgcgatca acgcgcccac ccagacactg ttcggccgcc cgctgatcgg cgacggcgcc 3 60 

aacgggaccg cggagaaccc ggacggccaa aacggcggcc tgctgttcgg caacggcggc 420 

aacggcttta cccagacgac cgccggggtg gccggcggca acggcggcag cgcggggttg 480 

atcggcaacg gcggggccgg cggcggcggc ggggccggcg ccgccggcgg cctcggcggc 540 

60 aacggcgggt ggctgtacgg caacggcggg gccggcggca tcgggggcgc gggcaccgga 600 

accggtggtc acggcggggc cggcggggcc ggcggccggg cctggctgtg gggcaccggc 660 

ggggccggcg gagccggcgg tgacggcggc tggttgttcg gcgacggcgg ggccggcggc 720 

accggcggca acggcggcag cggctttaac agcttgacct cttcggtcgg cggcgccggc 780 

ggggccggtg ggcacgccgg gctgttcggc gccggcggga ccggcgggac cggcggcatc 840 

65 ggcgggcaaa acaccgagac cggcccggcc gccagcaacg gcggcgcggg cggcgccggt 900 

ggcggcggcg ggtacctggt cggcgatggc ggcgccggcg ggaccggcgg ggccggcggg 960 

aagaattcca gcggtggccjc caccctcacc gggggcaccg gagggaccgg cggggccggc 1020 
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ggggcggccg ggtggctcta cggcagcggc ggcgccggcg gtgccggcgg cgccggcggg 1080 

ctcaacaacg ccggtgjcftgc caccggcggc accggcggta ccggcggagc cggcggctct 114 0 

ggagcgtggc tgtaccggcaa cggcggggcc gccggggccg gcggcaacgg cggcaacaat 12 0 0 

accagcgccg gcaccggtgg tgtcggggct agcggcggga ccggcggaaa cgccgggctg 1260 

5 atcggcgccg gcggccacgg cggggccggc ggcgccggcg gaaaccaaac cggtggcgtg 1320 

ggcaacggcg gggccggcgg gaacggcggc gccggcgggg ccggtggtca gctgtacggc 1380 

aacggcgggg acggcgcjcaa cggcggggcc ggcggggcca acatcgccgg cggcaatggc 1440 

agcgacggcg gcgccg-ccgg ccacggcggg gccggcggga gcgcccggct gatcggagcc 1500 

ggcggccacg gcggggacgg cggcgccggc gggaacaccg ccggcagaag ggccgacgcg 1560 

10 atcgccggca ccggcgggga cggcggcaac ggcgggaatg gcggcttgct aagcggcaac 162 0 

gccggggccg gcggcca.cgg cggggcgggc gggagcagca ccgcgaccac caccaccgga 1680 

acacccccaa cgggtgcaac gggcggcaat ggcggcaacg gcggggccgg cggcacggcc 1740 

gggtttaccg gcagcggcgg catcggcggc aacggcgggg ccggcggcac cggcggtaac 1800 

gccggtgtcg ccttgtcggt tggcagcacg ggcggactgg gcggfcaacgg .-oggcagcggg -I860 

15 ggcctcggcg gcggcggcgg gtcgctcttc ggcaatggcg gggccggcgg tgtcggcgca 192 0 

accggcggaa acggcggaag cggtatcggg cccgccagcg tgggtggcaa cggcggcaag 1980 

ggcggcgttg gtgcggccgg cgggcttgcc gggcagatcg gcaacggcgg tagtggtggg 204 0 

tccggcggtg ccggggcjcaa cggcgggacc ggcgataccg ccggcaacgg tggcaatggt 2100 

ggtgccggcg cggtcgcjcgg caacgcccag ctcatcggca acggcggcaa cggcggtggc 2160 

20 ggcgggaacg gcggaaccgg cgccgacggc acctaa 2196 
<212> Type : DNA 
<211> Length : 219 6 

SequenceName : SEQ ID 539 
SequenceDescription : 

25 

Sequence 



<213> Organi smName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenc eSt ring : 

30 atgccggggc ggttcagaaa cttcggtagc caaaacctgg gtagcggcaa catcggcagc 60 

accaacgtgg gcagcggcaa catcggcagc accaacgtgg gcagcggcaa catcggcgac 120 

acgaacttcg gtaacgg-aaa caacggcaac ttcaactttg gtagcggcaa taccggcagt 180 

aacaacatcg gcttcgcfaaa caccggcagc gggaatttcg gtttcggaaa cacgggcaac 240 

aacaacatcg gtatcgcjcfct caccggcgat ggtcagatcg gcatcggcgg actgaactcg 3 00 

35 ggcagcggaa acatcggrttt cgggaactcc ggcaccggaa acgtcggttt gttcaactcc 360 

ggcaccggca acgtaggctt cgggaactcc ggtactgcga acactggatt cgggaacgcg 420 

ggcaacgtca acaccggatt ttggaacggc ggcagcacaa acactggcct cgctaacgcc 480 

ggcgccggca acacaggctt tttcgacgct ggcaactaca acttcggcag tcttaacgcc 540 

ggaaacataa actcgagttt tgggaattcg ggtgacggca acagtggttt cctcaatgct 600 

40 ggcgacgtca actccgg^tgt gggcaatgcg ggtgatgtca acactggctt agggaactcg 660 

ggcaacatca atactggfcgg gtttaatccg ggcacgctca acacgggctt cttcagcgcg 720 

atgacccaag ctggtccgaa ttcgggcttc ttcaacgccg gtaccggtaa ctctggtttc 780 

gggcacaacg acccggctgg cagtggcaac tcgggcattc agaactcggg cttcggcaac 84 0 

tcgggctatg tcaataccag caccacaagc atgttcggcg gtaactcagg ggtgctcaac 900 

45 acgggctacg gcaactcagg tttctataac gcggccgtca acaacaccgg gatttttgtg 960 

accggcgtga tgagttcggg atttttcaat tttgggacgg gcaactcggg cctgctggtc 1020 

agcggcaatg ggctttccjgg tttcttcaag aacttgttcg gatag 1065 
<212> Type : DNA 
<211> Length ; 106 5 

50 SequenceName : SEQ ID 540 

SequenceDescription : 

Sequence 



55 <213> Organ ismNaxne : Mycobacterium tuberculosis H3 7Rv 
<4 00> PreSequenc eSt ring -. 

atgtcgttcg tgctggcgat gccggaggtg ttggggtcgg cggcaacgga tctggccgct 60 

ctgggctcgg tgctgggcgc ggccgatgcg gccgcggcgg ctacgacgac gggcatcgtg 12 0 

gccgcggccc aggatgagjgt gtcggcggcg atcgcggcgt tgttttccgc ccacggccgg 180 

60 gcctatcagg tggccagfccgc gcaggcggcg gcggttcacg cccagttcgt ggaggcgttg 240 

agcgcgggtg cgggggccta cgccagcgcg gaggccgccg gcgcggcggt gctggccaac 3 00 

ccggcgcaga gcgtgcagca ggacctgctg gccgccgtca atgcgcaaag tgtcgcgctc 360 

acggggcgcc cgttgatccjg caacggcgcc aacggggccc cgggcacggg ggccaatggt 420 

gcgccgggcg ggtggttcfct cggtaatggt ggggccggcg ggtccgccgc cgctggctcg 48 0 

65 ggcctgcccg gcggggccgg cggggccgcc gggttgttcg gcaccggcgg ggctggtggg 540 

gccggcggga gttccacggt aggtgatggc gaggccgggg gtgccggtgg atcaggtggc 600 

tggttgttgg gcaccggtgg ggtcggcggg gtcggcgggc tcggggccgg cgccggtggg 660 
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gccggcgggg ttggtggggc cggcgggctg ttgggtgctg gcgggcacgg cggcgccggc 720 

gggctaggcg ccgtcaccgg tggggtcggg ggaactggcg gagccggtgg gctgctggcc 78 0 

gggctgctgg ccgggccggg cggggccggc gggaccggcg gacgtggcfct tctcaacaac 840 

ggtggggtcg gtcjgggctgg cggcaacgcc gggctgctgt tcggtgccgg cggcaccggt 900 

5 ggatccggcg gagccggcct aggtggtgac ggtggggccg gtggggccgg cggcaacacc 960 

ggtgtgctgt tcggcaacgc cggatccggg gggaccggcg ggttcggcga taccgacggg 102 0 

ggagccggcg gtgccggcgg tgacgccggc tggttgggct ccggtggggt cggcggggcc 108 0 

ggcgggttcg gcgaaaccgg tgacgggggt gtcggcgggg ccggcggcaa ggccgggttg 1140 

ctgatcggta accjgcggggc cggcggcgcc ggtgggcaag gcgccgtgac cggcggtacc 1200 

10 ggcggggccg gcg-gcgacgg ggtgctgatc ggcaacggcg gcaacgccgg catcggcgga 12 60 

accggaccga ccgcgggtga taccggcgcg ggtgggatca gtgggctgct gctgggcgcc 1320 

gacggcttca acaccccggc cagcgcctct ccgctgcaca ccctgaaaca acaggcgctg 13 80 

gccgcgatca acgcgccgac ccagacactg accgggcgac cgctgatcgg caacggcacc 1440 

,cccggggcgg tccggcagcgg ggccaccggg gcccccggtg ggtqgetgct cggcgacggc 1500 

15 ggggccggcg ggfcccggcgc ggcgggctcg ggcgcgcccg gcggggcggg cggggctgcc 1560 

gggctgtggg gtaccggcgg ggccggcggg gccggaggca gctcggcggg tggcggcggg 162 0 

gccggtgggg ccggcggggc cggcggctgg ctgctcggcg acggcggggc cggcgggatc 1680 

ggcggagcca gcaccgtact cggcggcacc ggcgggggag gcggggtcgg tgggctgtgg 1740 

ggcgccggtg gggccggcgg ggccggtgga accggccttg ttggtggcga cggcggggcc 18 00 

20 ggtggggccg gccjggaccgg cggactgctg gccgggctga tcggtgccgg cggaggtcac 1860 

ggcgggaccg gccjggctcag cactaatggc gacggcgggg ttggcggggc cggcgggaat 192 0 

gccggaatgc tcgccgggcc gggcggcgcc ggcggagccg gcggtgacgg cgaaaacctg 1980 

gacaccggtg gggacggcgg ggccggcggt agcgcagggc tgctgttcgg cagcggcggc 2 040 

gccggcggcg ccggcggatt tggtttcctc ggtggggacg gcggggccgg tggcaacgcc 2100 

25 gggctgctgt tgtccagcgg cggggccggc gggttcggcg ggttcggcac cgccggtggg 2160 

gtcggtgggg ccggcggcaa tgccggctgg ctgggcttcg gcggggccgg tggcgtcggc 222 0 

ggcagcgccg ggctgatcgg caccggcggc aacggcggca acggcggcac cggcgccaac 22 80 

gccggcagcc ccggaaccgg cggcgccggc gggttgctgc tgggccaaaa cgggctcaac 2340 

gggttgccgt ag 23 52 

30 <212> Type : DNA 

<211> Length = 2352 

SequenceName : SEQ ID 541 
SequencerDescription : 



35 



Sequence 



40 



45 



50 



55 



60 



65 



<213> OrganismName r Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

gtgtcgttgg tgatcgcgac gccgcagctg ctggcaactg 
attggttcgc aggtgagcgc ggctaatgcg gccgcggcga 



gctgcggctg ccgatgaagt 
cagtatcagg cgctcagcgt 
actgcggccg cgggtcggta 
gcggtgaatg cgcccaccga 
gggacggcac ccgggcagcc 
ggcgcggctg gcgggttcgg 
aacggcggca acggcggggc 
gggtggttgt ggggcaacgg 
atcgggggtg cgggcggtaa 
ggtaccggcg gcgccggcct 
gccagcaccg gggacagccc 
ggcggcacgg gcggccatgg 



cggctttgga 
tgccgacgac 
tgttcggggc 
agcagtttgt 
ttgagcggag 
tgatcggaaa 
tgtttggcaa 
gcgcggccgg 



gtcggcggcg attgcggggt 
acaggtggca gcgtttcacg 
tgccagcact gaggccgctg 
ggcgcttttg gggcgcccgt 
tggcgcggcc ggcgggttgc 
tcaaaccggc ggcagcggag 
cggtggtacc ggcgcggccg gcggtgccgg 
cggcaacggc ggtgtcggcg gcaccagcgt 
cggcggcaac gccgggctgt tcggccatgg 
cgccggggca aacggggtca atcccacgcc 
ggcagatgtg tccggcatcg gtgatcaaac 
cactgccggc acgccgaccg gtggcaccgg 



gccaccgcga cggcaggctc gggcaaggcc accggcggtg ccggtggtga 



gccgctgccg gtggcggcgg 
gcgagcgcct ttcjgcggtga 
ggtggtagcg gcggcgccgg 
ggtggtagcg gcggtttcgg 
ggcgcagggg gagctggcgg 



cggcaacggc ggcgacggcg gagtcgcgca 
tggtggcaac gggtccgacg gtgtagccgc 
aggcggcgct ttcgtacaca tcgccactgc 
tggtaacggg gctgccagtg ccgcctccgg 
caatggtggc gccggcgggt tgctattcgg 



aacggtggcg ccggtggcgc gggtggtatc ggtggtgacg gcgccacggg 



ggaagcggcg gcaacgctgg 
gatgtggtcg gcgrgcaaggg 
ggcgccggcg ggaccggcgg 
aacggcggca acggcggcaa 
ggggttggcg gtaacggcgg 
ggtgtctggg cggtcggtgg 
gtcggcgggg tgggcggagc 
atcggcaacg gcggcaacgg 



catcgcgagg tttgacagcc cagaccccga 
tggtgatggc ggcaagggcg gcagcggcct 
cgcgggcggc aacggcggcg ccggcgggtt 
cgccggggcc ggcggggatg gcggcgccgg 
cggtggtggc accgcgacgt ttcacgaaga 
cgtaggtggt gatggtggct ccggcggcag 
cggtggcgtg ggtggcaagg gtggcgccag 
tggcagcggc ggagtcggtg gggccggtgg 



tttagcgagt 
ggaagtggtg 
ccatgctcgg 
gcaggcgttg 
tctgctgggt 
cggcgccgac 
cggtggcaac 
gttgatcggc 

tgggaacggg 

ggccgcaggc 
cggcgccggt 
cggccccgcg 
cggcggcgac 
cggcgacggt 
cggcggtacc 
gggcgacatt 
cggcagtggg 
cacctctacc 
cgccgacggt 
tgatggcggc 
ggggcccggg 
ggcagaaccc 
tggcgtcggc 
gttgttcggc 
cgttgccggt 
cccggtcgct 
ctcgcttggt 
cggcatgtfcg 
agtcggcggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
144 0 
1500 
1560 
1620 
1680 
1740 
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10 



gctggcggtg acggcggcaa cggcggctcc ggtggcaacg ccagtacttt 
aactccatcg gcggggccgg cgggacgggc ggcaacgggg gcaacggcgc 
aacggtggcg ctggcggtat tgccggcggt gcgggtgggt ccggagggtt 
gccgcaggag tcagcggcgc tgacggtatc ggtggcgcgg gcggcgcagg 
ggcgcgggcg gtagcggcgg tgaggcaggc gcggggggcc tcaccaacgg 
cctggcgttt ccggcaccga aggcatggcc ggcgcgcccg gctag 
<212> Type : DNA 
<211> Length : 2085 

SequenceName : SEQ ID 542 

SequenceDescription : 



tggcgatgag 
aaacggcggt 
cctcagcggt 
cggtgccggt 
ccccgggtcc 



1800 
1860 
1920 
1980 
2040 
2085 



Sequence 



Origan ismName : Mycobacterium tuberculoses K3 7Rv 
PreSequenceString 



<213> 

15 <400> 

atgtcgtttg tgattgcggc gccagaggtt 
ctcgagtcga gcatcgccgc ggccaacgcg 
gccgcgggtg ccgatgaagt ctcgacggcg 
gcctatcagg cgctcagcgc ccaagcgcag 

20 acctccggtg gcggcgcgta cgccgccgcc 
ccgatcaacg agttcttcct ggcgaatacc 
ggcgcccccg gcaccggggc caatggcggg 
gccggaggat ccggcgcggc cggcgtcaac 
gggctcatcg gcaacggcgg cgccggcggc 

25 ggcgccggcg gcgccggcgg ggccgccggc 
cccggcggat tcgcagccgc tttcggcgcc 
ggcgggctgt tcgccgacgg cggggtcggt 
ggcggggccg gcggatccgg cggaaacggc 
cccggcggat tcggcatctt tggcggcggc 

30 ttcggcgccg gcggcaccgg cggtagtggc 
ggtggggccg gcggcgacgc cggcatgctc 
agcggcggat ccaacccgga cggtggcggc 
acgttgttcg gctccggcgg cgccggcggc 
gcgggcggcg ccggcggaaa ggcgggtctg 

55 ggcggcgggt ccttcgcggg cgccggcgga 
gtcggcaacg ccggcaacgg cggcaacggc 
ggcggcgccg gcggaagcgg cgtgctgatc 
actggcgcac ccgccggcac ggccggcgcc 
gacggattca acgcccccgc cagcacgcca 

40 gcgatcaacg agcccaccca agccctcacc 
ggcactccgg ggaccggggc cgacggtggg 
aacggcggcc acggagcgac cggcgccgac 
ggaatcctgt caggtattgg cggcaccggc 
ggcgggaccg gcgggaccgg cggagccgcc 

45 agcggcgggt ttggcctgga caccggcggg 
ttcctcggtg ccgcagggac cggcggccag 
ggcggcaccg ccggagccgg tggtaccggt 
gccggcgggt ttggcgcgaa tggcgggacc 
ggcaccggcg gggcgggcac gctcggcgcc 

50 ttcggcgccg gcggcactgg cggtgctggc 
ggcggcagcg gcggcaacgc cggcctactc 
agcggcggca gcgccttaaa tgtcggcggg 
ggcggctcgc tcttcggctt cggcggtgcc 
agcagcggcg gcaccggcgg tgacggcggt 

55 ggcggcgccg gcgggtttgg cgctgacacc 
gtgctgatcg gcaacggcgg caacggcggc 
gccggcggca ccagcgggct gatcatcggc 

<212> Type : DNA 
60 <211> Length : 2514 

SequenceName : SEQ ID 543 
SequenceDescription : 



atcgcggcag 
gccgcggcgg 
gttgcggcgc 
gcgtttcatg 
gaggccgccg 
gggcgcccgc 
gacggcggct 

ggcggggccg 



cggcaacgga 
ccaacaccac 
tgttcggcgc 
cccagttcgt 
ccacctcgcc 
tgatcggcaa 
ggttaatcgg 
gcggcaacgg 



gccggcggga gggccagcac 
atgctgttcg gggccgccgg 
accggcggcg ccggcggggc 
ggcgccggcg gggcaaccga 
gggctgttcg gcgccggcgg 
gccggcgggg atggcggctc 
gggacgagca taatcaacgt 
agcctcggtg ccgccggcgg 
ggtgccggtg ggatcggagg 
gtctgcgggc tgggcttcga 
ctcatcgggg cgggcggtgc 
accggtgggg ccggcggcgc 
ggcgccagcg cgaacggcgc 
ggcaatggcg gtaacggtgg 
ggcggcttag gcgggcagct 
ctgcacaccc tgcagcaaca 
gggcgaccgc tgatcggcaa 
gccggcgggt ggttgttcgg 
ggcggagacg gcggatccgg 
gggtccggcg gcatcggaac 
ttgctgatcg gctccggcgg 
gccggcgggc gcggcggtga 
gccgccctct cccaaaactt 
gggctgttcg ccaacggcgg 
ggggggaacg gattgctctt 
gacggcggcg ccggggggca 
ggcagcagcg gcggcacctt 
gccctcggcg cctccggcgg 
accggcgggg tcggcgggaa 
ggcggcaccg gtggctccag 
acggccgggg tgttcggcaa 
ggtgggaatt cgagctcggt 
aacggcggga aagccggcgg 
gagaacgggc tcaacggcct 



tttagccagt 
agcactgctg 
ccacggccag 
gcaggcgttg 
gctgctcgcc 
cggcaccaac 
caacggcggc 
cggcgccggc 

ggggaccggc 

ggtcggcggt 
cggcggaaac 
cgccggcacc 
caccggtggg 
tggcgggctg 
cggcgggaac 
cgccggcggc 
cgacggcggc 
cgccggcggg 
cggcggcgcc 
gcccgggctc 
cggcgccgcc 
tagtggcggc 
actaggtcga 
aatcctcaac 
cggcgccaac 
caacggcggc 
cggggccggc 
gacgggccaa 
caccggcggc 
cgccggcttg 
tatcggtgcc 
ggccggcggg 
cggcgccggc 
cggcgggctg 
cggcggcaac 
cgccggcggc 
cggcggcagt 
cggcatcggc 
cggcggcgac 
ccccaacgcc 
cacacccggc 
gtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2514 



65 



Sequence 

<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<4 00> PreSequenceString : 
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atgtcgtttg tgatcgcggt gccggagact 
ctcggctcga cgatcgctgg ggccaacgcg 
gccgccggtg ccgatgagat ctcggcggca 
gcctatcagg cggcgagcgc cgaggcggcg 
5 accaccgggg ggggcgccta tgcggccgcc 
tcgatcaacg cgcccgtcct ggccgctacc 
ggtgctcccg gcaccggggc caacggaggg 
gccggcggat ccggtgcaaa gggcgccaac 
gggctgttcg gcaacggcgg ggccggcggt 

10 atcggcgggg ccggtggcgc tggcgggtcc 
ggcgccggcg gggctgcgac gfcctcttgtc 
ggcaacgccg gtatgctcgc cggcgccgcc 
agcactgccg gtggggctgg cggcgccggc 
gtcggcggcg ccggtgggca gggtcacacg 

15 gggttgtttg gtgccggcgg catgggcggg 
ggcaccggcg gggccggcgg ggacggtggg 
99 c 9gggcag gcgggtcagg actgaccacc 
gggacgcttt ccctgggtgc cgccggtggc 
gtcttcggtg gtggtaaggg cggcgccggc 

20 ggctccggcg ggggtggcgg caccggcggg 
gtcggcggca gcgccggcat gctcagcggc 
gggggccccg cgggcaccgc cgcaggtggg 
atcggcaacg gcggcaacgg cggcaacggc 
ggggccggtg gaaatgccgt gctgatcggc 

25 gccggcaagt ccggcttcgg cggcttcggc 
gctcccgaga gcacctcgcc atggcacaac 
gaacccaccg aggcattgac cggacgcccg 
ggaaccgggg acgatggcgg tgccggcggc 
gccggtgcgg ccggcaccaa cggcagcgcg 

30 tttggcaccg gtggcgccgg cggggccggc 
ggtggcgccg gcggatccgc cfctcttgatc 
gccgccacca ccaccggcgg cgtcggcggg 
gcggctgggc tcggcgggtg tggcggcggc 
gccggcggga ctggcggcgc tgccgggfctg 

35 gggaccggca gcaccgccgg gggcgccggc 
cacgggggaa ccggcggacc aggtgggaac 
ggtgccggcg ggcccggtgg gctgtacggt 
gggggcatgg ccggcggggg tggcggtgta 
gcgtcgggcg gtgccggcgg cagcggcggc 

40 .ggcgccggcg gcagcgcggg afctgttctac 
tacagcctca atggcacfcgg cggtgatggc 
cttcgcagcg gcttcggcgg cgccggcggc 
gggaacggcg gcgccggcgg caaggccggg 
ggcggggatg gcgccaccag cggcaagggt 

45 aacggcggca acggcggcaa tgccggaaaa 
ggtgggctgg tactcggccg ggatgggcag 
<212> Type : DNA 
<211> Length : 2745 

SequenceName : SEQ ID 544 

50 SequenceDe script ion : 



atcgcggcgg cggcgacgga tctagccgat 60 

gctgcggcgg ccaacacgac gagcctgctg 120 

atcgctgcgt tgttcggcgc gcacggccgg 180 

gcgtttcatg gtcggttcgt gcaggcgctg 24 0 

gaggccgccg ccgtgacgcc gctgctcaac 3 00 

ggccgtccgc tgatcggtaa cggggctaac 3 60 

gatgccggct ggttgatcgg caacggtggc 42 0 

ggcggggctg gtggccctgg tggggccgcc 48 0 

gccggcggaa ccgccaccgc caacaacggg 54 0 

gccatgctgt ttggggccgg cggcgccggc 60 0 

ggtggcatcg gcggtaccgg cggaaccggc 660 

ggggccggcg gtgccggcgg gttcagcttc 72 0 

ggggccggtg ggctgttcac caccggcggt 780 

ggcggggcgg gcggcgccgg cggggccggc ' 840 

gcgggcggat tcggggatca cggaacgctc 90 0 

ggcggcggtt tgttcggcgc cggcggggac 960 

ggcggcgctg ccggtaacgg tggtaacgcc 1020 

gccggcggca ccggtggggc tggcggcact 1080 

ggagccggcg gtaacgccgg catgctcttc 1140 

ttcggcttcg ccgccggcgg gcagggtggg 12 00 

tccggcggct ccggcggtgc tggcggctct 1260 

gcgggtgggg cgggtggggc gcccgggttg 1320 

ggcgagagtg gcggcaccgg tggtgtcggc 13 80 

aacggcggcg agggcggcat cggcgcgctc 1440 

gggttgctgc tgggcgccga cggatataac 1500 

ctgcagcagg acattctcag tttcatcaac 1560 

ctgatcggta acggcgacag tgggacgccg 1620 

tggttgttcg gcaacggcgg caacggcggg 168 0 

ggcggcgccg gtggggcagg cgggatcctg 1740 

ggcgtcggaa cggcgggtgc cggcggggcc 1800 

ggttccggcg gtaccggtgg tgtcggcggg 18 60 

gccggcggga acgccggctt gctcatcggc 192 0 

gctttcaccg caggcgttac cactggcggc 198 0 

ttcgccaacg ggggggccgg cggcgccggc 2040 

ggggccggcg gggccggcgg gctgtacgcc 2100 

ggcggctcca cgggggccgg agggacaggc 2160 

gccggcggct ctggcggggc cggcggtcat 2220 

ggcggcaatg ctggctcgct 'caccctcaat 2280 

tccagcctgt caggcaaggc cggtgctggc 2340 

ggctccggcg gggccggcgg caacgggggc 2400 

ggcaccggtg gggccggcca aatcaccggc 2460 

gccggcggcg ccagcgatac cggcgccggc 252 0 

ctgtacggca acggcggtga cggtggcgcc 2580 

ggagccggcg gcaacgccgt ggtgatcggc 2640 

gccgggggca cggcgggtgc cggcggcgcc 2700 

cacggcttga cgtag 2745 



Sequence 



<213> OrganismName : Mycobacterii 
55 <400> PreSequenceString : 

atgtcgttgg tgatcgtaac cccggagacg 
atcgggtcat cgatcggtgt agccaacagc 
gccgcgggcg ccgatgaggt gtcggcggcg 
gagtatcagg cgatcagcac gcaggtggca 
60 agcgccgcgg tcggctcgta tgtcagcgcg 
ctggagcaca acgtgctcaa tgccctcaat 
ttgatcggtg atggggcggc tggagcaccc 
atcttgtggg gcaacggtgg ggccggcggc 
ggcggggccg ccgggttgtt cggcaccggc 
65 gccggtgggg ccgggggtag cggcggctgg 
ggcgggcaga gcctgctggg cggggcaacc 
ggggtcggcg gaaccggcgg gcccggcggg 



im tuberculosis H37Rv 

gtggccgcgg cggcctcgga tgtggcgcgc 60 

gcggcggcgg ggtcaaccac cagcgtgctg 12 0 

atcgcgacgc tgtttggcag ccatgctcgg 18 0 

gcgtttcatg accgatttgc gcagacgtta 24 0 

gaggcgacca acgccgcacc gttggcgacg 300 

gcgcccaccc aggcgttgct gggtcgcccg 3 60 

ggcaccgggc aggccggcgg ggccggcggg 42 0 

tcgggcgcgc ccggccaagt cggcggggcc 48 0 

ggggccggtg gggccggtgg ggccggcgcc 540 

ctgctgggca atggtggagt cggcggggcc 60 0 

ggcggggccg gcggcaacgc cggactgttc 660 

cccggcgggc ccggcggggt cggcggtacc 72 0 
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ggtggtgccg gtggcctggg cgggaccctc tacggggccg gcgggcacgg aggtgccggc 78 0 

gggcccggcc cgatcggtgg tgtgggtgga cacggjcggtg tcgggggtgc ggccgggctg 840 

ttgggcgtgg gcgggcacgg tggtgccggc ggacacggcg cggagggtgt ggccggcgca 900 

gccggtgagg acttgtcccc gcacggtacg tccgcjtgggg tcggcggcga cgccggcgat 960 

5 ggcggcaccg gagggcgggg cggctggctg gccgcjcgccg gtggggccgg cggggccggt 102 0 

ggggttggcg ggaccggcgg ggccggcggg gccgcjatttt ctcgtgcctt gattgtcgct 10 80 

ggggataacg gcggtgatgg tggtaacggc gggafcgggcg gggctggcgg ggctggcggc 1140 

cccggcgggg ccggcggcct gatcagcctg ctggcgcggcc aaggcgccgg cggggccggc 12 0 0 

gggaccggcg gcgctggtgg cgtcggcggt gaccgcggag ccggcggccc cggcaaccag 1260 

10 gccttcaacg caggtgccgg cggggccggc ggccatggcg gtgaccccgg cgccggcggg 13 2 0 

gccggaggca ccggcggagc cggctccatc accgcjcgctc agggcgccat cggcgccacc 13 80 

cccaccagcg gcggcaacgg cggggccggc ggcaacggcg ccaacgccac caccgcgggc 1440 

accaacggcg ccaacggcgcj acccggcggc catgcjcgggc tggtcggcaa cggccjgcgcc 150 9_ 

ggcggcaacg gcgccaacgg cgccgcaggg acgaacgcga gcgattcggg cgcagtcggt * 1560' 

15 ggcaaaggca atagcggcgg caacggcggc caggcjcggcg ccggcggcga cggcggaact 1620 

ctcgccggca atggcggggc cggtgggact ggcggccgcg gcgcagacgg cgggctcggt 168 0 

ggtagtggtg ccgagggtgc caatgccacc accgcgggcg agcgtggcca ggacggtggt 1740 

aaaggcggca acggtggggt cggcgggacc ggcggcaacg ccgtcgcacc cggtgccaac 18 0 0 

ggcggccatg gcggcaacgg cggcaatcct ggcttcagcg gcgctggcgg gctcgggggg 1860 

20 ctcagcggtg acggcgtgac ccgcgccgcg cagggjtgcga cccccgactt tgcagacacc 1920 

gggggcaaag gcggcaacgg cggtaacggc gccaacgccg ttgcacccgg cggcaccggc 1980 

gccagtggcg gagccggcgg gaacgctggg gctgcgcggca agggcgggga aaacatcatc 2040 

ggcgatggcg gcggcggaaa cggcggggcc ggcgcjcaagg gcggcgccgg caccctgcta 2100 

ggtctcacgg tatttggcga caatggcggc gccggcgtcc tcggcgactc gacggaccca 2160 

25 gatggcagcg gcggtgctgg tggcgcgggc ggcgccggtg gcgccggtgg cgatccaacc 2220 

atctga 2226 
<212> Type : DNA 
<211> Length : 2226 

SequenceName : SEQ ID 545 

30 SequenceDe script ion : 

Sequence 



35 



40 



45 



50 



55 



60 



65 



atcggcaatg 
tacggcaacg 
ggttccgcgg 
ggcggggccg 
ggcccaacgg 
ttgatcggct 
gccggcggca 
ggcggatcga 
ttccttggcc 
ggtggcgccg 



gcgcgaatgg 
gcggcaacgg 
gcctgatcgg 
gcggcgccgg 
acgttcctgc 

ggggcggcaa 

acggcggcgc 
gcgaagacgt 
tgggcggtga 
gcggcaccgc 



cggcgcagaa 
cgtccacgac 
tgttcagtga 
acagcttcgt 
acgcttcggc 
cgttgttcgg 

gcggtgatgg 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgtcgtttg tcacggcagc tccagagatg ctggcgacgg 
atcggcacat cgctgagtgc ggcaaacgcg acggcagcgg 
gcggccggag ccgacgaggt atcgcaggct atcgcaaggc 
cactatcagt cgctgaacgc tcaagccgcg gcatttcatc 
aacgccgccg gtggcgccta ttcgagcgcc gaggcggcca 
gaacagaatc tgttggccgt gatcaatgcg cccgcccagg 

aacagcggcc acjccccaacg 
cttctcccaa acgaccgccg gggtggccgg 
caacggcggc aatggtggcg ccggtggggc 
cggatggctg ctcggcaacg 
cggcacaggt ggagccggcg 
cggcgggccc ggcggtttcg 
cagcggttcg ctctfctggcg 
cggcggcacc ggcgcjggccg 
tggcggcgcc ggcggcacca 
gggaggtcga ttgttcagcc 
cggatccaac gccggtgacg 
ccagggcggc tccggcggcc 
gggcggcgcc ggcgctgtgc 



ggtggtgccg gtaccgcaat 
agcgccctga tcggctacgc 
accggcggcg acggcggcct 



ggtttcggcg gcctcggtgg cggctccaac ggcaccgggg 



acgggcgcca cgctgatcgg 
gtcaacgtgg gcaacggcgt 
atcggcctgg gcgccggcgg 
ggcaatggcg gtgacggcgg 
ggcggcaacg ccggcaacgg 
gtcattgcca acggctcctt 
ggcgtcaatg gcggcaccgg 
ggaccgtcgt ag 
<212> Type : DNA 
<211> Length : 1632 

SequenceName : SEQ ID 
SequenceDescription : 



cctgggcgcc ggcggcggcg 
cggcggtctg ggcggccagg 
tgccggcggt gccggcggcg 
tgacggtggc ggcctgttta 
cgccatgcct gccaatggcg 
tgccccgtcg ttcgtcggct 
cggcagcggc gggatccttt 



546 



gtggcgccgg 
gggccggcgg 
ctgcttttgg 
tcggcggcgc 
gcggcgctgg 
gcaacaacaa 
tgggcggtga 
gtggcgccgg 
tcggcggctt 
tcatcggcac 
gcgcgggcgg 
gcggcatcgg 
gcggccaggg 
ccacagtcgt 
gtatcggcgt 
gcaacggcgg 
tcggcggcaa 
ttggcgccaa 



tgtcgcgaat 
ctcggtgctg 
ttacgccacg 
gcaaacgttg 
gcaggcgttg 
gcgtcccctg 
tgggattttg 
cggcgccggt 
cggtgctgcc 
cggtcccggc 
cgacgcccca 
aaacggtggg 
cggcggcgtc 
ccgcggtcta 
cggcggtgac 
cggtggcaac 
cggtgacagc 
cggcgaaagt 

gggcgtcggc 

cgcgggcggc 
cgggttcgcc 
cgccgcgctg 
tggacttggt 
cggtggggac 
caacgccggg 
cggcggcaac 
cggcgcgaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1632 
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60 
120 
180 



Sequence 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
5 <400> Pre Sequenc eSt ring : 

atgtcgtatg tattggcgac gccggagatg gtggcagcgg cagcaaacaa tttggcgcag 
atcggctcga cgttgagcgc ggccaatgcc gcggcgctgg ccccgaccac cggagtgctg 
gccgcgggcg ccgacgaggt gtcggcagcg gtggcgtcgc tgttttccgg gcacgcccag 

gcctatcaga cactcggcac gcaggcggcc gcgtttcatg aacggtttat ccaggccttg 240 

10 agcacggctg cgggcgcata tggtagcgcc gaggccgcaa atgcctcccc gctgcagcaa 3 00 

gcgttgaatg tgatcaacgc gcccacgcag acgctgctcg ggcgccccct gatcggcaac 3 60 

ggcaccaatg gtgcgccggg taccgggcag gccggcgggc cgggtggctt gttgtatggc 420 

aacggcggca atggcgggtc cggcggggtc ggccaagccg gcggagccgg cggcagcgcc 480 

gggttgatcg gcatcggcgg gaccgggggg gccggagggg ccggcgcggt cggcggcgtc 540 

15 ggcggcaacg gggggtggtt atacggcaat ggcggggccg gcgggctcgg ggggaccggg 600 

gtggccggcg tcaatggcgg tatgggcgca gcaggaggtg ccggcggcaa cgcctacctg 660 

ttcggttctg gcggcgcggg cgggcagggc ggtatggggg cagccggggc agacggcgtc 720 

aaccccacac ccaccggcac cgctgatgcc ggcagtaccg gcaccgacca gacgctgggc 780 

ggcaacgcca taggcggtaa cggtggcccc ggcgatgccg gcgacgcgat gacgtccggc 840 

20 ggcgctggcg ggtctggcgg gaacgccgtc tcaaccgfcca acggcgacgc cgtgggcggt 900 

gagggaggca aagggggtga gggcgcctat ggcggcgctg gcggcgccgg cggcagcgcc 960 

gcttccatcg gcaatgcagc cataggaggt aacggcggtg ccggcgggaa cgcccaggcc 1020 

cccggcggtg tggggggcgc cggcggcgaa ggcggggatg cccaggtggg caccaattct 10 80 

cccagcaacg cggaagccgg taacggcggc agcggcggca acggcttcga cagcttcgct 1140 

25 tccggcggta ccggcggagc gggcggaacc gggggcgccg gtggccgcgg cgggctgctg 12 0 0 

atcggcgacg gcggcgccgg cggcgcgggc ggagtcggtg gtaccggtgg tagcggtgcc 1260 

cccggcggcg gcggcggcgc cgggggcgac ggcggtgccg ccaacaccga cagcgccggc 13 2 0 

agttcacgca aggcgttcgg gggcgatggc ggtgtgggcg gtgacggcgc gagcgccctc 13 8 0 

ggcaccggtg gcgaaggcgg tatcggcggc cagggcggta acgggggtgc tggcgggctg 1440 

30 ctcatcggca acggcggcgc cggtggtgtc ggcggcacgg ccggtgccgg tggtactggt 150 0 

ggttccggtg gtgctggagg tgccggaggc gccggtggtg gcggcaccaa cagtggtccg 1560 

ggcgcagcgt ttggcggtaa cggcaacacc ggcggcaacg gcggcaacgg cggcgccccc 162 0 

ggcgccctcg gcggcaaggg cgggtccggc gggctgattg gccgcgcggg cagcgacggt 1680 

ggcgttggtg ccggaggagc gggcggcgcg ggtggccjccg gcggcacagg cggtgagggc 1740 

35 ggcacgggcg gcgacgggaa aaccaccgac ggcaatcccg gcatgggcgg cagcccgggc 180 0 

agcgccggcc aacccggcta a 1821 
<212> Type : DNTA 
<211> Length ; 1821 

SequenceName : SEQ ID 547 

40 SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

45 <400> PreSequenceString : 

atgtcgtttt tgtttgcaca gccggaaatg ctgggcgjcgg cggcgacgga tctggcaagc 60 

atcggctcgg cgatcagcac ggccaatgct gcggccgcgg ccgccacgac gcgtgtgctg 12 0 

gccgccggtg ccgatgaggt gtccgccgcc gtggcggcgc tgtttagcgg ccacgcgcag 180 

acctatcagg cgctgagaac tcaggcggcg gcgtttcacc agcagatcgt gcagaccctc 24 0 

50 acgagcacgg caggcgcgta tgccagcgcc gaggcccjcca acgtcgagca gcagctgctg 3 00 

ggtgcgatca acgcgccgac catggcgctg ctggggcgcc cgctgatcgg ccacggcgcc 360 

gacggggcgc cggggaccgg gcaggccggc ggggccggcg ggatcctgta cggcaacggc 420 

ggcaacggcg ggtccggcgc caccggtcag gccggcggtg cgggcggggc ggccgggctg 480 

atcggccacg gcggggccgg cggcctgggt ggcaccggcg cgtccggtgg tgccggcggg 540 

55 gccggcggat ggctgtgggg caacggcggg gccggcggca atggcggggt cggggtggcc 600 

ggcgaccccg gtggtgtcgg cggtgccggc ggtgccggcg gcgccgccgg attgtggggc 660 

agcggagggt ccggcggcac cggcgggcaa gggggggtcg gtggcggcaa gtccggcgac 72 0 

ggcggcacgg ggggtatcgg cggcgccggt ggcggtggtg gctggctgca cggcgacggc 78 0 

ggcgccggcg gacacggcgg gcagggcgga accggcgtca gctcaggagg caacggcggg 840 

60 gccggcggca ccggcggcga cggccgcggc ctgtcgggca gcggcggggc cggcgggcgc 900 

ggcgggcaaa ctggggttgg aggcaaagtc ggggagaata acttcggggg cgcgggcggt 960 

gccggcggta ccggcgggct catcggcaac ggcggtgccg gcggcaacgg cgggcaaggc 1020 

gcaattagcg gcgccggcgg ggccggtggc aatgcctggc tgatcggcga cggcggtgcc 108 0 

ggcggcaacg gcggtgatat ccgtggccag ggcggcggcg ccggtggagc cggcggcgct 1140 

65 ggtgggcaac tcattggcaa cggcggcacc gggggggcag gcgggaccgt caccagtccg 12 0 0 

aacggtcttg gcggtgctgg cggagccggc gggtccgccg gtctgattgg ccacggcggc 12 60 

accggcgggg ccggcgggca cagcgcccag gggccccjacg gcaatggcgg aattggtggt 1320 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



gccggcgggg 
ggcggcaagg 
gggcgaggcg 
ggcggcaccg 
gccgccgtcc 
aacgatggca 
ttcggccagc 



ccggtggcaa 
gcggcgacgg 
gtgccgccgg 
ccggcgagga 
taatcggcaa 
cccccggcaa 
ccggccaacc 



cggcggacaa 
cttcggcgtg 
cctgatcggc 
cggtaccggc 
cggcgggggc 
cggcgggggc 
cggtccgccc 



ctctacggca 
ttcggcaagg 
gacgccggga 
ggcaacggcg 
ggcggcgccg 
ggcggcgtcg 
ggccagcccg 



ccggcggcac 
gcggcgccgg 
ccggcgggac 
ggaccggcgg 
gcggaaacgg 
gcgggaccgg 
gccccgccta 



cggcggcacc 
cgggaccggc 
cggcggaaag 
aaacggtggg 

tggggccggc 

gggaaccctg 



<212> Type : DNA 
10 <211> Length : 1731 

SequenceName : SEQ ID 548 
SequenceDe script ion : 



Sequence 

<213> OrganisitiName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

gtgtggacgt cgcagatgat cgtggcgccg gcgtttgtcg 



gcaactattg gttcggcgat 
ttactgcctg cgggcgctga 
gggcaggcat accaggagct 
ctcatgagcg cgggggcggc 
caaatcgtgg gccaaaccgc 



atgcagcggc 
cgttggtccc 
cggcgctgtt 
tccatgagca 
cagccaactc 
ccgtgcagac 
ccgggcaaaa 



tagccgggcc aacgccgaag 
cgacgtgtcg gctgcgattg 
cagtgcccac gcggtcgcat 
ccagtacgcc agcgccgagg 
cctcgatgcc atcaattcac 
cgtccgctta tcggcaacgg tgccaacggg gtcgcaggaa 
ggcggatggc tatacggcaa cggtggcaac ggcgggtccg gcgggacggg 
ggcaacggcg ggtcggctgg gctatggggt agcggcggca atggcggcca 
ggtgccaacg gcgcagccgg tcaacccggg aaagcgggcg ggtccggcgg 
gccggtggat ggatctatgg tcacggtgga catggtgggg ccggcggaaa 
gccacagcgc ccgggggtgc gtcggcaggc ttcgatgggg gcgccggcgg 
tcgggtggtc gcggtggact gttgttcggc aacggcggca. acggctcggt 
ggaggacaag gcactaatga cacagccgga gattcggcccj gcagcggcgg 
aacgggggca acggcgccca gggcgggtgg ctgatcggca, acggggggca 
agcggcgccg gcggcggcac cgactccact caaacgggcg tcatgaacgg 
ggttcggccg ggatagccgg taacggcggt gacgcgggcc tagtcggcaa 
ggtggcaacg gcgggaatgg agcggccggg tctgcgctgg gtactaccat 
agcggcgggg tcggtggctc aggcggcgac ggcggcaacg gcgggtggtt 
ggcgcgtccg gcggcaacgg gggtcagggc ggtgacgcag gcaccaacgg 
tttggcggct ctgctggcgg tggcggctgg gtaggtgccg ttaacttcgg 
gtccagggtt ttgggttgtt tggtcacggc ggtgacggtg gcaacggtgg 
gccggcagcc tcagcattca atttggtgca tcagggggtg acggcggcca 
ctatacggca atggcggcaa cggtggtaac gccggcagcg gcggaggtac 
ggcagcgccg gccagggtgg cgccgccatt ctgatcggca acggcggggc 
ggggcgaccg gtgggaccgg agtaggcaac attattcagcj aagccggagg 
gacggtggcg cgggcggcag cggaggccta ctgttcggta gcggcggggc 
ggcggggccg gcggcgtcgg cgggtcgggc aacgacggcgj gcaacggcgg 
caaggcgggg caagcggcct ggggatcggc aacggcggac ccgggggaag 
ggtggggccg gcggaaccgg cggcagtgcg ggcactggcg gtgccggcgg 
aacgccgccc tgctgatcgg taccggtggc gacggcggag atggtgtccc 
gggggccaag gcggcaaggg cggattgatc ggtttgcctcj ggcagaacgg 



aaaggaccta 

gataacggcg 

cgcaacgcac 

gttcgtccag 

gtcaccattg 

gctgaccggg 

cggtggcgat 

ccaaaatggc 

gggcggggcg 

caacggcggc 

cgggggcaac 

aaacgggggt 

cggtggcatg 

attaggaggc 

aggtggggac 

cgcttccggc 

cggcggggcc 

cttcggcggg 

gttcggcagc 

atttgcgggc 

accgattagt 

tgacgttggg 

aggtggggtg 

cggcttcgaa 

cggagggaat 

tgacggcagc 

cggcggcatc 

cgatggtggc 

tggcggtacc 

tgacggtggt 

acccgcaccc 

gcagccgtag 



<212> Type : DNA 

<211> Length : 1920 

SequenceName : SEQ ID 
SequenceDe script ion : 

Sequence 



549 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgtcatggg tgatggtttc gccggagctg gtggtggcgg cggcagcgga tttggcgggg 
atcgggtcgg cgattagctc ggctaatgcg gcggcggccs tcaacacgac gggattgttg 
accgcgggtg ccgatgaggt gtcgacagcg attgcggcgt tgttcggtgc ccaaggccag 
gcctaccagg cggcgagcgc acaggcggcg gcgttttacg cccagttcgt gcaggccctg 
agcgccggcg gaggcgcgta tgcggccgcc gaggccgcccj ccgtgtcgcc gctgctggcc 
ccgatcaacg cgcaattcgt ggcggccacc gggcgcccgc tgatcggcaa cggcgccaac 
ggcgcccccg ggaccggagc caacggcggg cccggcgggt ggttgatcgg caacggcggc 
gccggcgggt ctggcgcccc cggcgctggg gccggcggta acggcggggc cggcgggctg 



1380 
1440 
1500 
1560 
1620 
1680 
1731 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



60 
120 
180 
240 
300 
360 
420 
480 
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ttcggcagcg gcggggccgg cggggcctcc accgacgtcg ccggcggggc cg-gtggggcc 540 

ggcggggccg gcggaaacgc cggcatgctg ttcggcgccg ccggggtcgg cgcjcgtcggc 600 

ggattctcga acggcggtgc caccggcggg gcaggcgggg ccggcggggc gg-gcgggctg 660 

tttggcgccg gaagggaacg cggcagcggc gggtcgggca acctcactgg cg-gggccggc 720 

5 ggggccggcg gcaacgccgg gacactcgcc actggtgatg gcggggccgg cgjggaccggc 78 0 

ggcgctagtc gcagcggcgg attcggcggg gccggcggag ccggcggcga cgjccggcatg 840 

ttcttcggct ccggcggctc cggcggcgcc ggcggcatta gtaaaagcgt cgjgggacagc 9 00 

gccgccggcg gggccggcgg ggcccccggg ctgatcggca acggcggcaa cgjgcggcaac 9 60 

ggcggcgcga gcaccggcgg cggggacggt gggcccggcg gggccggcgg ca_ccggcgtg 1020 

10 ttgatcggca acggcggcaa cggcggcagc ggcgggaccg gcgcgaccct ggjgcaaggcc 10 80 

ggcatcggcg gtaccggggg ggtgctgttg ggcctggacg gctttacggc ccccgccagc 1140 

acctcgcccc tgcacaccct- gcagcaggac gtgatcaata tggtgaacga ccccttccag 12 0 0 

acgctcaccg ggcgtccgct gatcggcaac ggcgccaacg, gcactccggg ga_ccggggct 1260 

gacggcggag ccggcggctg gttgttcggc aacggcggaa acggcgggca g^gaacgatc i« 13 20 

15 ggcggcgtca acggcggggc cggcggggcc ggcggggccg gcgggatctt gttcggcacc 13 80 

ggcggcaccg ggggcagcgg cgggcccggc gccaccggcc tcggcgggat tg-gcggggcc 1440 

ggcggagccg ccttgctctt cggctccggc ggggccggcg gaagcggtgg tgjccggcgcg 150 0 

gtcggtggca atggcggggc cggcggcaac gccggtgcgc tcttgggcgc cg-ccggggcc 1560 

ggcggggccg gtggtgccgg cgcggtcggt ggcaatggcg gggccggcgg tsacggcggg 1620 

20 ctgttcgcca acgggggagc cggcgggccc ggtgggtttg gcagccccgc tggggctggc 16 8 0 

gggatcggcg gggcaggtgg gaacggcggg ctgttcggcg ccggcgggac ccjgcggggcc 1740 

ggcgggggaa gcaccctcgc cggcggcgcc ggcggggcgg gcggcaacgg cgggctgttc 180 0 

ggcgccggcg gcaccggcgg cgccggcagc catagcaccg ccgccggagt ttccggaggg 1860 

gccggcgggg ccggcggcga cgccggcttg ctctccctcg gcgcctccgg cggggccggc 192 0 

25 ggcagcggcg gttccagcct gaccgccgcc ggcgtggtcg gcggcatcgg ccggcgccgga 198 0 

ggcttgctct tcggctccgg cggcgccggc gggagcggcg ggttcagcaa ctctggcaac 2040 

ggcggcgccg gcggggccgg cggcgacgcg ggtttgctcg tcggctccgg cggggccggc 210 0 

ggggccggcg cctccgccac cggcgccgcc accggcgggg acggcggggc cggcggcaag 2160 

tccggagcgt tcggtctcgg aggtgacggc ggcgccggcg gcgccaccgg tttgtccggt 222 0 

30 gctttccaca tcggcggcaa gggcggcgtc ggcggcagcg ccgtgctgat cggcaacggc 228 0 

ggcaacggcg gcaacggcgg taacagcggt aacgccggga aatccggggg tgcacccggc 2340 

cccagcggcg ccggcggcgc cggcgggctg ctgctcggtg agaacgggct gaacggcttg 240 0 

atgtag ~ 2406 
<212> Type : DNA 

35. <211> Length : 2406 

SequenceName : SEQ ID 550 
SequenceDescription : 



40 



45 



50 



55 



60 



65 



Sequence 

<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

ggccagagct atcaagcggt cagcgcccag gcggcggcgt ttcatgaccg 
ctgcttaacg ccggtggagg ttcatatgcg agcgccgaga ttgccaacgc 
ctgctgaacg cggtgaacgc gcccacccag acgctgctgg ggcgtccgct 
ggcgccgacg gggccagtgg tccggtggga cagcccggcg gggacggcgg 
ggcaacggcg gcaacggtgg cgacagcacg agccccgggg ttgccggggg 
tcagcggggc tgatcggcaa cggcggcagg ggcggcaacg gtgcgcccgg 
ggcaatggcg gcctgggcgg attgctgctg ggcaacgggg gtgccggggg 
accggtgaca acggtgtggg agacctcggt gctggcggcg ggggaggcga 
ggtgg ac ggg cggggctgat cggtcacggc ggtgccggcg gaaacggtgg 
cacggcggga gcggcaaggc cggcggcagt ggcggcagtg gcggcttcgg 
ggcgccggcg ggctgctgta cggcaatgga ggggcggccg gttccggcgg 
gatgcaggta ccggcgtctc cagcgacggt ttcgccgggt tgggcggcag 
ggcggcgacg cggggctgat tggtgtcggc ggcggcggcg gcggcaacgg 
gggctcggtg cgcgcctgtt ccaggtaggt agtcgcggcg gcgacggtgg 
tggctgtacg gcgatggcgg cggcggcggc gacggcggta atggggggct 
ggctccacca acgccggcaa cggcggcagc gcgcggctca tcggcaacgg 
ggtagcggcg ggagcggcgc gcctggctcg gtcagcagcg gtggcgtcgg 
aaccccggcg gcagcggtgg caacggcggc gtgtggtacg gcaacggtgg 
gccgccggcc aaggggggcc cggcatgaac accacctcgc ccggcgggcc 
ggcgggcacg gcggcaccgc catcttgttc ggcgacggtg gcgccggtgg 
gccggcggac ccggtactcc ggacggggcc gcggggcccg gcggcagcgg 
gggctgctgt tcggagtccc cggcccgtcc ggcccggacg ggtaa 
<212> Type : DNA 
<211> Length : 1305 

SequenceName : SEQ ID 551 



gttcgtccaa 
gcagcagaac 
ggtcggcgac 
coitcttgtgg 
agccggcggg 
cggtgcagga 
agtcggcggt 

tsgcggtttg 

ggacggtggg 
ccagttcggt 
caacggtggc 
cggtggccgg 
cggcgaccca 
ggtcggcggg 
gccatttatt 
tggtgccggc 
gggcgcgggc 
cgccggcggg 
cjggcggtgtc 
cjgctggcgcc 
cggcaccggc 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1305 
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SequenceDe script ion : 
Sequence 

<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString ; 

atggctcatt tttcggtgtt gccgccggag atcaactcgt tgcggatgta cctgggtgcc 60 

ggttcggcgc cgatgcttca ggcggcggcg gcctgggacg ggctggccgc ggagttggga 120 

accgccgcgt cgtcgttctc cfccggtgacc acggggttaa ccgggcaggc gtggcagggc 180 

ccggcgtcgg cggcgatggc cgccgcggcg gcgccgtatg cgggcttttt gaccacagcc 240 

tcggctcaag cccagctggc tgccgggcag gctaaggcgg tggccagcgt gttcgaggcc 30 0 

gccaaggccg cgatcgtgcc tccggccgcg gtggcggcca accgtgaggc gttcttggcg 3 60 

ttgattcggt cgaattggct ggggctcaac gcgccgtgga ccgccgccgt tgaaagcctt 420 

tacgaggaat actggg'ccgc tgatgtggcg gcgatgaccg gctatcacgc cggggcctcg '" 48 0 

caggccgccg cgcagttgcc gttgccggcc ggcctgcaac agttcctcaa caccctgccc 540 

aatctgggca tcggcaacca gggcaacgcc aacctcggcg gcggcaacac cggcagcggc 60 0 

aacatcggca acggaaacaa aggcagctcc aacctcggcg gcggcaacat cggcaataac 660 

aacatcggca gcggcaaccg aggcagcgac aacttcggcg ccggcaacgt cggcaccgga 720 

aacatcggct tcggcaacca gggccccata gacgttaacc fccttggcgac gccgggccag 78 0 

aacaacgtgg gcctgggcaa catcggcaac aacaacatgg gcttcggcaa caccggcgac 840 

gccaacaccg gcggcggcaa caccggcaac ggcaacatcg gjtggcggcaa caccggcaac 900 

aacaacttcg gcttcggcaa caccggcaac aacaacatcg gaatcgggct caccggcaac 960 

aatcagatgg gcatcaacct ggccgggctg ctgaactccg gcagcggcaa tatcggcatc 102 0 

ggcaactccg gcaccaacaa catcggcttg ttcaactccg cjcagcggcaa catcggcgtc 1080 

25 ttcaacaccg gagccaatac cctggtgcct ggcgacctca a.caacctggg cgtcgggaat 1140 

tccggcaacg ccaacatcgg cttcgggaac gcgggcgttc tcaacaccgg cttcgggaac 1200 

gcgagcatcc tcaacaccgg cttggggaac gcgggtgaat taaacaccgg cttcggaaac 12 60 

gcgggcttcg tcaacacggg gtttgacaac tccggcaacg tcaacaccgg caatgggaac 1320 

tcgggcaaca tcaacaccgg ctcgtggaat gcgggcaatg tgaacaccgg tttcgggatc 13 80 

30 attaccgaca gcggcctgac caactcgggc ttcggcaaca ccggcaccga cgtctcgggc 1440 

ttcttcaaca cccccaccgg ccccttagcc gtcgacgtct ccgggttctt caacacggcc 1500 

agcgggggca ctgtcatcaa cggccagacc tcgggcattg gcaacatcgg cgtcccgggc 1560 

accctctttg gctccgtccg gagcggcttg aacacgggcc tgtttaacat gggcaccgcc 1620 

atatcggggt tgttcaacct gcgccagctg ttggggtag 1659 
35 <212> Type : DNA 

<211> Length = 1659 

SequenceWame r SEQ ID 552 
SequenceDescription : 

40 Sequence 



50 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<4 00> PreSequenceString : 

atgtcgttcc tgattgcttc gccggaggcg ctagcggcga cagccacata tttgacaggt 60 

45 atcggttcgg caatcagcgc ggcgaacgcg gtcgcggccg ccccgacaac agagatcctg 12 0 

gcggcgggga ccgacgaggt gtccaccgcc atctcagcgc tgttcggcgc tcatgcccag 180 

gcatatcagg cgctcagcgc ccacgtggcg gcatttcacg accagttcgt gcataccttg 240 

accgccggtg ccggctcata catggccgcc gaggccgccg ccgcctcgcc tctgcaggct 3 00 

ttgcagctgg agctgctcaa cgccatcaat gcacccaccc tggcgctgtt gggacgcccg 3 60 

ttgatcggcg acggcaccga tgcggcgccg gggagcgggg gggccggcgg ggccggcggc 420 

atcttgatcg gcaacggcgg gaccggcggc gccagcgact tagccgggac cggccgcggc 480 

ggggtcggcg gggcgggcgg cgccggcggg ctcttcggca tcggcggcgc cggcgggggc 540 

tgcgggtccg cggtggcgat cgggggtgac ggcggggctg gtggcgccgg cggcgtgttc 600 

agcggcggcg gcgccggcgg ggccggcgac gccatcgggg gtagcggcgg cgcgggcggc 660 

55 accggtgggc tgttgggtgg tggcggcggc gcgggcggcg ccggcggcgc cggcggcaat 720 

ggcgggggcg ccagcaacag cgcaagtatc gggggtgacg gtgggtccgg cggcgcgggc 78 0 

ggcatgctct acggtgccgg cggcgtcggc ggcaacggcg gggccgcggt cgctatcggg 840 

ggtgacggcg gggccggcgg cagggccgga gcgatcggca acggcggtga cggcggcaac 900 

ggcgggactt ccaacacccc cggcggtagc ggcggcgacg gcggcaatgg cgggaacgcc 960 

60 ggactgatcg gcaacggcgg taacggcggc aacgccgaga ttgtcatctc cggcggtagc 1020 

gtcgccggca ccggtggcaa cggcgggttg ctgttgggct tcaacggcac gaacgggctg 1080 

ccgtag 10 86 
<212> Type : DNA 
<211> Length : 1086 
65 SequenceName : SEQ ID 553 
SequenceDescription : 
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Sequence 



<213> OrganisraName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 

5 atgaaaaaat caaaaatttt aagaaaattt ttagcaacag cctcactatg tgggacatta 60 

ttcactaact ctaatgcaac aggaacaatt attcctaata atggtagcgt aagcttgaat 120 

actgacgcag gtcttgtagg aggagtattt aacaatggcg atattatcca aatagttaat 180 

993-gggcgtg aaattaaaat atcggcagat aaagcaaatg ctatcattgg aggaatcaat 240 

acattaaaag aactgcctga ttttggtggt gttgaagtaa gtcaaaacgt ctcaataggc 3 00 

10 cctcttaatg caggagaaga tcttaatact aattttggcc ctcttaaatt tattagtaat 3 60 

aatgttacgt caattattac aggagtcggt actaaaacat ttagtaafcat cgattttgct 420 

ggtaaaaatg ctactttaca aattaataaa gatttaaata ttacaactaa aatagataat 480 

acagtagccg gaaataatgg ttcaataaca tttgaaggta gcggtattat atcaaatcac 540 

ataggctaca ctaactctct tttaggaata aatgtaggaa acggagasigc* caagatttat 6*00 

15 gccccagaag caaataatat tacaattaat gctaaaaata taaatcttac tcacaataac 6 60 

tctatactta ctctttgtga cggtaacata actacattaa agggtaatat aaataatact 720 

acagaaattg acggtcaagg tatattaaat ctagcttatg atcttggtag tagtagcata 780 

ataacaggtg atataggtaa tataggctca ttagatacaa taaatgtttt acttggatct 840 

gcaacattta attctacaat attaaaagcg actaatatta atttaaaaca taatacttca 900 

20 acactcaatt tagatgataa cataattgtt attggtaata taaaaggtaa taataacaaa 960 

gatatattaa attttaaagt gcatggtact aatttagata acgaaatgat tattcctgct 1020 

cctcaaaaaa ctcatggtac attaaatttt aaaggaaatg ctacacttaa tgggaatata 10 80 

aataatttaa atatacttaa gtttagtgga ggtcacggta aaactttgaa tttacaaggt 1140 

aatactaaag tagataatct tgtttttgca gatagtgttt tagattcagg tactataagt 1200 

25 gttaacgggt tgttagatac agactgtgtg acatttaaca atagtaatgt taacggcgga 1260 

acattaataa taaatgccaa gaacacaatc agtgcaaaat tattaaatgc tacaaaagca 1320 

aaaatacaaa tcaatgctaa tttaacgatg aatcatccaa gtgctggcjga tataagtgat 1380 

attagaatag cggataatac aatctataca atagatgcaa aaaatgggaa tgtaaatttg 1440 

ctaaataaca acgcaaagat catatttgaa ggagctgatt ctatgttagc tttaatcaat 1500 

30 actggtgtta cagctgatag aacctttacc atatataata atttaaatca atctggcaat 1560 

gatgaatatg gaatagtaaa aatagaagca attaaaaaag taataactat agcaaatcaa 1620 

agtggacctt atactatagg gcaggataat acacatcgtc ttaaggaatt aatagtagag 1680 

ggagcaggtg atatcataat agatgatacg atatttacga agttacttag tataaacagt 1740 

acaggacaaa taacatttaa tcgtacttta gatttaggtg caggtggtaa tattgcattt 180 0 

35 ggaaagcatg gtacattagt agtaaacggt gtcactggtt caattacaac ttctgaaaat 1860 

aatcaaggaa tattaacaat caatagcggg aatattactg gtgttattgg cactaatgaa 1920 

ctcggcttaa aacttgtcaa tattggtgca gatcctgtta cgtgctcagc aaatgtattt 1980 

gcatcggtag ctttaactaa cccaagttct gtgttaattt tagcagatgg tgttacgcta 2040 

actggtgaag taacaacaca taataataca aaaggtgtat tatcacttgg aacaggaagt 2100 

40 aatataacag gtcaaatcgg tactaatagc gcagctcttg agaaaataaa tattggggct 2160 

ggagctagta atattgacag taatatatat gcaggttcta cagtactcac agatcaaaca 222 0 

tcggaattaa ctttaaacaa tgatgtagtc gttaacagta atatcataac tactgccggg 228 0 

aacaatagcg gaaagttaat atttacaggt aatggcggca taacaggaaa cataggagca 23 40 

aatggtgcag ctttacaaga ggttgtattt aacggtacta ctaatatagg tggtacagcg 2400 

45 aactcacaga actttactgt tgcacattct gcagcaaatg tggtgattac aggacttact 2460 

actggtgcat taaaatacaa agatactggt acaattatag ctcatgg^agg attggtagga 252 0 

gacatcgatt ttaataataa agctggtaaa ttcattttag gcgatgg-tgc tatgattgat 258 0 

ggatcagtgt tatgtaatgg aggggttgct ggtacattgg attttatagg tgacggtaat 2640 

gtaactcaaa atataggcgc agataatgca aatagtattt cgactatcaa cattcaaggc 270 0 

50 gataatacaa aaaacgtaac tatagcaaat gatatatttg tagataatat tcattttaca 276 0 

aacggtggga tattacagct tggcggaaat ctcacaacgc ataatattga tttcggagca 282 0 

aatggtggta ctttagaatt taatggtaat aatacatata acttaaafcgc tattattgta 28 8 0 

aacggacaaa acggtatatt aaatgctttc acaaacttaa aggctagcga tgatactatt 294 0 

ggtacggtta aaataattaa tataggacaa ataggaacac cgcaaaactt tactattcaa 300 0 

55 gttaataaca aaaatttaac tctagtaagt agcgtaaata gtagcattaa ttttggtgat 3 060 

gctaattcgc aattaatatt atctgcacca gtagatcaaa ctattaaatt tattaataat 312 0 

ttaaacgaaa ctggaggtgg tattatcact ttagatagta atggtaataa tttaaccata 3180 

agtggtaata atggaataaa gcttggtagt aaaggtaatg agttatctag cttaaatatt 324 0 

aaaggaaaag ttactgtaac taatgattta gatatacaaa atattcatca attaaatata 33 00 

60 aacaatggtg cattatttga tgaccaaagt ctcacatctg ctaaaatcaa aaatataaat 33 60 

attggtacag tagcaggcgg agctacttat actttagatg ctataaatga taattttgat 342 0 

ttaaatacta gtggtatggt atttaaacat caagattcaa tattagaact gaaaaatagt 348 0 

tcaaatacta atgaccacac tataacatta acatctgctt tagatccagg taataatcaa 354 0 

tttggtataa ttaaactcat aacagatact aataaattaa ctatagacaa taatggcaat 3600 

65 gtggcttata cactcggtac agcaaatcat atgttgaagc aattaacttt tgctagtata 3660 

gataatgggg ctatagcttt aaaagtaggg atcaatgttg aaaatgttac cttaaatatt 3720 

aaggatatag agttaaatga ggtgaatgca aatgttctgt ttaacaaaaa cacaacatat 3780 
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accgcaacag 

cttaatgatg 

ttaaatttca 

gcaggagcag 

ggtaacggta 

aagacaggcg 

atcggcgcaa 

aatactctta 

aacgtaactg 

aatcatacac 

gaagtcgcaa 

tcaaatgcca 

actacggcag 

ggcataatag 

gtgactatta 

aacgtaaaac 

ttagcaaatg 

cttaatatag 

aaaaatttag 

ctatttaatt 

acagcaaatt 

aaatttaatg 

gcatctgata 

aatgcaaatt 

attgtattag 

ttactgacta 

agtcaaggat 

cactccgata 

ggtgcaatat 

aataaatttg 

ggtcgagacg 

tgtcgtgata 

tcaagtagcg 

atcttagatt 

catgttaaag 

atgtctccga 

ggacttaatg 

atagcaataa 

agcgatgcaa 

aatctaaaaa 

atagtaacgg 

agtgcaagcg 

gataattcta 

aatgataaaa 

tataattggc 

atcaaaaatt 

attaatactt 

accactataa 

gaaaataata 

actatattag 

gaattacatt 

cttgacggta 

aatttaggcg 

ttatctcttg 

taa 



gtaatataaa 
atatagaaat 
atggatcagg 
gtgatgtatc 
ataataattt 
gtcaggattt 
atgcagcagt 
aaagcggtaa 
ctactgatat 
ctattaatat 
ataatgatgt 
ctcaggcagc 
gaaataatat 
gagatgcaaa 
attctccaca 
tcaatataga 
tacaaattag 
acgcaggtaa 
atataccaga 
acttcactga 
ttaaagatgc 
acaatgcttg 
aatttatgtt 
tagtattatt 
atttagctaa 
ttatcactta 
ctaatgtcga 
tcactaacat 
atactccagt 
tgaaatgggt 
atactggagg 
attgtgatgt 
ataaaaatta 
atactaaaaa 
attttggtaa 
atagagttac 
aaggagtagt 
atatggataa 
atactgtaaa 
gactgaatac 
gcatttgggg 
gttatcaatc 
tagttatagg 
atggtgatag 
ttactaataa 
atgaaaaacg 
tttatagcta 
cgccaatgtt 
ctactttcca 
gtttaaatag 
ggtttataaa 
tagatgaacc 
gtggtatttc 
cgaagaaata 



cggtcatgta 
tgacggtagt 
taaagtgact 
actatctgct 
gacatttgct 
aaatttagta 
tggtgatatt 
tattgttata 
ctcaggcaaa 
cactagtaca 
taccattaca 
taccttaact 
tcatacttta 
taatagatta 
tgtttattca 
aggcggtatt 

tgaggatact 

aactataaat 
tgctctaata 
tattaaagcg 
tgttgtaata 
gttaacacaa 
gcttcagaaa 
agataatgtg 
ttatgaatta 
ttttgatact 
tatgtccgat 
tacctcagat 
accgcaaact 
cgctgatgca 
tcgagacgat 
agggaatatc 
tggcatcact 
taattatgta 
tactactgat 
tgaaacatta 
aggactgaac 
tttcactgct 
tggtctcaac 
taataatcag 
catgtcattt 
taatacaggt 
ggcggcttat 
aaccaaagct 
cttttttgtt 
tataactact 
tgaattacta 
tggtatgcgt 
aaatttatct 
tgtaactcat 
ttatcaatgt 
attgacaaca 
tactaaaaat 
tacagcacat 



gatttccaag 
gttacaagta 
ggtttaatca 
agcggtaatt 
gctaattcac 
ttcataaatg 
attataaacg 
tcggatggtg 
aatgcaaata 
cttggtaata 
gggacattac 
ttaggcgcag 
gaagtaacag 
aaatcaatag 
tctattacta 
acttatgatt 
actattagag 
tttgatagag 
gatttagatg 
gataatttaa 
gatgcacata 
gaaattaaaa 
aacattaaag 
gaagtaaata 
aaatatactg 
gcattacaaa 
ttagataatt 
actaaacacc 
aaagtcatta 
aacggtttgg 
actaggggtc 
agcaataaca 
gatgttgtac 
gcttctggta 
gcaggtaaat 
gatagactta 
ggtatcgagg 
aaagagattg 
aaaacaaaca 
gcaataattg 
tatggtaaaa 
ggtggtataa 
actatggctg 
aaaagtaata 
gaagctatag 
attactgatc 
ggtggctata 
tatgctacat 
ataaagaaaa 
tatttatcac 
aaaaataagt 
attagattta 
aatatgatag 
caaggatcct 



gtaatgcagg 
ccggtaaLgu 
ataacatagt 
attctattac 
acttaacaac 
gtggtagcgt 
caggaagtgt 
ccacgatgca 
acggaacttt 
ataatgctat 
aagcgcaaaa 
catcgcaagt 
attttg-atac 
aattg^cagg 
ctgcaaataa 
taggaagtaa 
gtgatgtgta 
gtgataataa 
tattaccacg 
attttcjcaga 
ttgataatgg 
atgctaacat 
cagctacttt 
ctaatctaaa 
gtaatcjttac 
aaggtgggca 
taataattaa 
aaatagtaaa 
ttgatcjcaag 
tattacttac 
gaggcaatac 
gtagtaatga 
caatttttga 
tagcaaacca 
tattaaatga 
gtaatagaat 
ttgagaattt 
gcaataggtt 
cattgcttaa 
ctgcaggtga 
taaagcaaaa 
taggctttga 
atagtaaagt 
tatatfcctat 
gtgtgfcatgg 
aaatcgcaat 
actatctaat 
ttaaaaataa 
attactatga 
aagatataat 
taccaaatat 
aacctgcaaa 
aatttggtat 
taaagattaa 



tgtaataaat 
aaacggtaca 
aatgctgcaa 
tgaaattcaa 
tgatattaat 
tagcggttct 
aaatttcagt 
agttaataat 
aaaactaaat 
agggaccata 
tattcatttt 
gactaatatt 
tggtaatgac 
caatggcaca 
cgcgcaaggt 
aataaaaagt 
ttctaaatat 
catgaatccc 
ctcgctctca 
tgatacfcgct 
tggcatttta 
catagaaatt 
aatagctgat 
tgttagagat 
acataatgga 
tatattagtt 
aattaaagca 
actcgaaaca 
cgaggaacaa 
tgatactgga 
tgacaatggc 
agcaggtggg 
tccatctcct 
acttattaat 
tttaggtttt 
aaatgttaat 
tttaacagat 
agaagaatta 
taataagatt 
tgaagataat 
ctctaaaaac 
ttataatatt 
caagcacaaa 
ctatgggctt 
tagaaataaa 
aggtaaattt 
atcgcatcgt 
tggttacaaa 
taaatttgaa 
aataaagccc 
tgatgcacgc 
gataacatat 
tagatataac 
agtgaacctg 



<212> Type : DNA 

<211> Length : 7023 

SequenceName : SEQ ID 554 
SequenceDescription : 

Sequence 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
462 0 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7023 



<213> OrganismName : Rickettsia prowazekii strain Madrid E 
<40 0> PreSequenceString : 

atggctcaaa aaccaaattt tctaaaaaaa ataatttccg caggattggt aactgcttcc 
acggctacta tagtagctgg tttctctggt gtagcaatgg gtgctgctat gcaatataat 
aggacaacaa atgcagcagc tacaaccttt gatggtatag gctttgatca agctgctggt 



60 
120 
180 
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gctaatattc ctgtcgctcc aaattcagtt attactgcta atgctaataa tcctattact 240 

tttaatactc caaacggtca tttaaatagt ttatttttgg atactgcaaa tgatttagca 3 00 

gtaacaatta atgaggatac taccttagga tttataacaa atattgctca gcaggctaag 3 60 

ttctttaatt ttactgttgc tgctggtaaa attcttaaca taacagggca gggtattact 42 0 

5 gttcaagaag cttctaatac aataaatgct caaaatgctc ttacaaaagt gcatggtggc 480 

gctgctatta acgctaatga tcttagcggg ctaggatcaa taacctttgc tgctgcgcct 540 

tctgtattag aatttaattt aataaatcct acaactcaag aagctcctct tacacttggt 600 

gctaattcta aaatagttaa tggtggtaat gggacattaa atattactaa tggatttatt 660 

caggtttcag ataacacttt tgctggtatt aagaccatta atatcgatga ttgtcaaggt 720 

10 ttaatgttta attctactcc tgatgccgct aatactttaa atttacaagt aggtggtaat 780 

actattaatt ttaatggaat agacggtact ggtaaattag tattagtcag taagaatggt 840 

gctgctaccg aatttaatgt tacaggaact ttaggtggta atctaaaagg tattattgaa 9 00 

ttgaacactg cagcagtagc tggtaaactt atctctcaag gaggtgctgc taatgcagta 960 

ataggtacag ataatggagc aggtagagct gcaggattta ttgttagtgt tgataatggt 1020 

15 aatgcagcaa caatttctgg acaagtttat gctaaaaaca tggtgataca aagtgctaat 10 80 

gcaggtggac aagtcacttt tgaacacata gttgatgttg gtttaggcgg taccaccaac 1140 

tttaaaactg cagattctaa agttataata acagaaaact caaactttgg ttctactaat 12 0 0 

tttggtaatc ttgacacaca gattgtagtc cctgatacta agattcttaa aggtaacttc 1260 

ataggtgatg taaaaaataa cggtaatact gcaggtgtga ttacttttaa tgctaatggt 132 0 

20 gctttagtaa gtgctagtac tgatccaaat attgcagtaa caaatattaa tgcaattgaa 13 80 

gcagaagggg ccggggttgt agaattatca ggaatacata ttgcagaatt acgtttaggg 1440 

aatggtggct ctatctttaa acttgctgat ggcacagtaa ttaatggtcc agttaaccaa 150 0 

aatgctctta tgaataataa tgctcttgca gctggttcta ttcagfctaga tgggagtgct 1560 

ataattaccg gtgatatagg taacggtggt gttaatgctg cgttacaaca cattacttta 1620 

25 gctaacgatg cttcaaaaat attagcactc gatggcgcaa atattatcgg ggctaatgtt 1680 

ggtggtgcaa ttcattttca agctaacggt ggtactatta aattaacaaa tactcaaaat 1740 

aatattgtag ttaattttga tttagatata actactgata aaacaggtgt tgttgatgca 180 0 

agtagtttaa caaataatca aactttaact attaatggta gtatcggtac tgttgtagcfc 1860 

aatactaaaa cacttgcaca attaaacatc gggtcaagta aaacaatatt aaatgctggc 192 0 

30 gatgtcgcta ttaacgagtt agttatagaa aataatggtt cagtacaact taatcacaat 1980 

acttacttaa taacaaaaac tatcaatgct gcaaaccaag gtcaaataat cgttgccgct 2 040 

gatcctctta atactaatac tactcttgct gatggtacaa atttaggtag tgcagaaaat 2100 

ccactttcta ctattcattt tgccactaaa gctgctaatg ctgactctat attaaatgta 2160 

ggtaaaggag taaatttata tgctaataat attactacta acgatgctaa tgtaggttct 222 0 

35 ttacacttta ggtctggfcgg tacaagtata gtaagtggta cagttggtgg acagcaaggt 22 8 0 

cataagctta ataatttaat attagataat ggtactactg ttaagfctttt aggtgataca 2340 

acatttaatg gtggtactaa aattgaaggt aaatccatct tgcaaatfcag caataattat 2400 

actactgatc atgttgaatc tgctgataat actggtacat tagaatttgt taacactgat 2460 

cctataaccg taacattaaa taaacaaggt gcttattttg gtgttttaaa acaagtaatt 2520 

40 atttctggtc caggtaacat agtatttaat gagataggta atgtaggaat tgtacatggt 2580 

atagcagcta attcaatttc ttttgaaaat gcaagtttag gtacatcttt attcttacct 2640 

agtggtactc cattagatgt tttaacaatt aaaagtaccg taggtaatgg tacagtagat 270 0 

aattttaatg ctcctatfcgt agttgtatca ggtattgata gtatgatcaa taacggtcaa 2760 

atcatcggtg ataaaaagaa tattatagct ctatcgcttg gaagtgataa cagtattact 2 82 0 

45 gttaatgcta atacattata ttcaggtatc agaactacaa aaaataatca aggtactgtg 2880 

acacttagtg gtggtatgcc taataatcct ggtacaattt atggtttagg tttagagaafc 2940 

ggtagtccaa agttaaaaca agtgacattt actacagatt ataacaactt aggtagtatt 3 000 

attgcaaata atgtaacaat taatgattat gtaactctta ctacaggagg tatagcaggg 3 060 

acagattttg acgctaaaat tactcttgga agtgttaacg gtaacgctaa cgtaaggttt 312 0 

50 gttgatagta cattttctga tcctagaagt atgattgttg ctactcaagc taataagggt 3180 

actgtaactt atttaggtaa tgcattagtt agtaatatcg gtagtttaga tactcctgta 324 0 

gcttctgtta gatttacagg taatgatagt ggggcaggat tacaaggcaa tatttattca 33 00 

caaaatatag attttggtac ttataattta actattctaa attctaatcjt cattttaggt 33 60 

ggtggtacta ctgctattaa tggtgaaatc gatcttctga caaataattt aatatttgca 3420 

55 aatggtactt caacatgggg tgataatact tctattagta caacgttaaa tgtatcaagc 34 80 

ggtaatatag gtcaagtagt cattgccgaa gatgctcaag ttaacgcaac aactacagga 3 540 

actacaacca ttaaaataca agataatgct aatgcaaatt tcagtggca'c acaagcttat 3600 

actttaattc aaggtggtgc tagatttaat ggtactttag gagctcctaa ctttgctgta 3660 

acaggaagta atattttcgt aaaatatgaa ctaatacgtg attctaacca ggattatgta 3720 

60 ttaacacgta ctaacgatgt attaaacgta gttacaacag ctgttggaaa tagtgcaatt 3780 

gcaaatgcac ctggtgtaag tcagaacatt tctagatgct tagaatcaac aaatacagca 3840 

gcttataata atatgctttt agctaaagat ccttctgatg ttgcaacatt tgtaggagct 3900 

attgctacag atacaagtgc ggctgtaact acagtaaact taaatgatac acaaaaaact 3960 

caagatctac ttagtaafcag gctaggtaca cttagatatc taagtaatgc tgaaacttct 4020 

65 gatgttgctg gatctgcaac aggtgcagtg tcttcaggtg atgaagcgga agtatcttat 40 80 

ggtgtatggg ctaaaccttt ctataacatt gcagaacaag acaaaaaagg tggtatagct 4140 

ggttataaag caaaaactac tggggttgta gttggtttag atactctcgc tagcgataac 4200 



WO 2005/076010 



232/341 



PCT/IN2005/000037 



ctaatgattg gggcagctat tgggatcact 
aaaggtgata aaactgatat taatggttta 
gttaagaatt tctttgctca aggtaatgca 
agtcagcgtt acttcttcga gtctaatggt 
5 tacgataaca tgacatttgg tggtaattta 
aatgtattag taactccaat ggcaggactt 
aaagaaaccg gtacaacagt tgcaaataag 
gatttaatag taggggctaa agtagctggt 
tatccggaaa ttcattcttt tgtggtgcac 
10 tctatgttag atggacaaac tgctccattt 
tcttataata taggcttaag tgcaaacata 
ggttatgatt ttaattctgc aagtaaatat 
gtaaacttct aa 
. <212> Type ; DNA 
15 <211> Length ': 4932 

SequenceName : SEQ ID 555 
SequenceDescription : 

Sequence 

20 

<213> OrganismName : Porphyromonas gingivalis W83 
<400> PreSequenceString : 

atggcacgaa ttatcttgga ggctcacgat gtatgggaag acggcacagg ctatcaaatg 60 

ctctgggatg cagatcacaa tcagtacggc gcatccattc ccgaagaatc tttttggttfc 12 0 

25 gccaacggaa cgatcccggc cggtctttac gatcctttcg agtataaagt tccggtcaat 180 

gccgatgcat ctttttctcc cacgaatttc gtgcttgatg gaacagcatc agccgatatt 240 

cctgccggca cttatgacta tgtaatcatt aaccccaatc ctggcataat atatatagta. 3 00 

ggagagggtg tctccaaagg taacgattat gtggtagagg ccggtaagac ttatcatttc 3 60 

actgtccaac gacaaggccc cggcgatgct gcgtccgttg tagtgaccgg agaaggtggc 42 0 

30 aatgaattcg ctcccgtaca gaatctccaa tggtctgtat ccgggcagac agtgaccctc 480 

acttggcaag cccccgcatc cgacaaacgg acttatgtgt tgaacgaaag cttcgatacg 540 

caaacgcttc ctaacggctg gacaatgatc gatgctgatg gtgatggtca caattggcta. 600 

tctacaataa acgttfcacaa cactgctact catacaggtg acggtgctat gtttagcaaa. 660 

tcatggacag ctagcagtgg tgcaaaaatt gatttgagtc ctgacaacta tttggtaact 72 0 

35 cctaagttta cggttcctga gaatggtaaa ctttcttatt gggtttcatc tcaagagcct 78 0 

tggactaatg agcattatgg agtgttcttg tccacaaccg gaaacgaggc tgcaaacttfc 840 

acgataaagc tgctggaaga aaccctcgga tccggcaaac ctgctccgat gaacttggtgj 900 

aagagtgaag gagtaaaggc tccggcacct tatcaggaaa gaaccatcga tctctctgcc 960 

tatgccggac aacaggtgta cttggcattc cgtcatttcg gctgtacagg tatattccgfc 1020 

40 ctttatcttg atgacgtggc tgtttctggt gaaggttctt ccaacgacta cacgtacacg 10 8 0 

gtatatcgtg acaatgttgt tatcgcccag aatctcacgg caacgacatt caatcaggaa. 114 0 

aatgtagctc ccggccagta caactactgt gttgaagtta agtacacagc cggcgtatcfc 12 00 

ccgaaggtat gtaaagacgt tacggtagaa ggatccaatg aatttgctcc tgtacagaac 126 0 

ctgaccggta gtgcagtcgg ccagaaagta acgctcaagt gggatgcacc taatggtacc 132 0 

45 ccgaatccga atccgggaac aacaacactt tccgaatcat tcgaaaatgg tattcctgcc 13 8 0 

tcatggaaga cgatcgatgc agacggtgac ggcaacaatt ggacgacgac ccctcctccc 1440 

ggaggctcct cttttgcagg tcacaacagt gcaatctgtg tctcttcggc ttcttatatc 15 00 

aactttgaag gccctcagaa ccctgataac tatctggtta caccggagct ttctcttcct 15 60 

aacggaggaa cgcttacttt ctgggtatgt gcacaagatg ccaattatgc atcagagcac 1620 

50 tatgccgtgt atgcatcttc tacgggtaac gacgcttcca acttcgccaa cgctttgttgj 1680 

gaagaagtgc tgacggccaa gacagttgtt acggcaccgg aagccattcg tggcactcgt 1740 

gttcagggca cctggtatca aaagacggta cagttgcctg cgggtactaa gtatgttgcc 18 0 0 

ttccgtcact tcggctgtac ggacttcttc tggatcaacc tcgatgatgt tgagatcaag 18 60 

gccaacggca agcgcgcaga cttcacggaa acgttcgagt cttctactca tggagaggca. 192 0 

55 ccagcggaat ggactactat cgatgccgat ggcgatggtc agggttggct ctgtctgtcfc 19 80 

tccggacaat tgggatggct gacagctcat ggcggcacca acgtagtagc ctctttctca 2 04 0 

tggaatggaa tggctttgaa tcctgataac tatctcatct caaaggatgt tacaggcgca. 2100 

acgaaggtaa agtactacta tgcagtcaac gacggttttc ccggggatca ctatgcggtg 2160 

atgatctcca agacgggcac gaacgccgga gacttcacgg tcgttttcga agaaacgccfc 222 0 

60 aacggaataa ataagggcgg agcaagattc ggtctttcca cggaagccaa tggcgccaaa. 2280 

cctcaaagtg tatggatcga gcgtacggta gatttgcctg cgggcacgaa gtatgttgct 2340 

ttccgtcact acaattgctc ggatttgaac tacattcttt tggatgatat tcagttcacc 24 00 

atgggtggca gccccacccc gaccgattat acctacacgg tgtatcgtga tggtacgaag 24 60 

atcaaggaag gtttgaccga aacgaccttc gaagaagacg gcgtagctac gggcaatcat 2520 

65 gagtattgcg tggaagtgaa gtacacagcc ggcgtatctc cgaaagagtg tgtaaacgta 2580 

actgttgatc ctgtgcagtt taatcctgta cagaacctga ccggtagtgc agtaggtcag 2640 

aaagtaacgc ttaagtggga tgcacctaat ggtaccccga atccgaatcc cggaacaact 270 0 



aaaactgata taaaacacca 
tcattctctc tatatggttc 
atctttacct taaacaaagt 
aagatgagca agcaaattgc 
atatttggtt atgattataa 
agctacttaa aatcttctaa 
cgcattaata gcaaatttag 
agtactgtga atataactga 
aaagtaaatg gtaaattatc 
atcagtcaac ctgatagaac 
aaatctgatg ctaagatgga 
actgcacatc aaggtacttt 



agattataag 42 60 

ccaacagctt 43 20 

caaaagtaaa 43 8 0 

tgctggtaat 4440 

tgcaatgcca 45 00 

tgaaaattat 4560 

tgatagagtc 4620 

tattgtgata. 4680 

taactctcag 4740 

tgctaaaacg 48 00 

gtatggtatc 4860 

aaaagtacgt 4920 
4932 
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acactttccg aatcattcga aaatggtatt 
ggtgacggca acaattggac gacgacccct 
aacagtgcga tctgtgtctc ttcggcttct 
gataactatc tggttacacc ggagctatct 
5 gtatgtgcac aagatgccaa ttatgcatca 
ggtaacgacg cttccaactt cgccaacgct 
gttgttacgg cacctgaagc cattcgtggc 
acggtacagt tgcctgcggg tactaagtat 
ttcttctgga tcaacctcga tgatgttgag 
10 acggaaacgt tcgagtcttc tactcatgga 
gccgatggcg atggtcaggg ttggctctgt 
gctcatggcg gcaccaacgt agtagcctct 
gataactatc tcatctcaaa ggatgttaca 
.gteaacgacg gttttcccgg ggatcactat 
15 gccggagact tcacggttgt tttcgaagaa 
agattcggtc tttccacgga agccaatggc 
acggtagatt tgcctgcggg cacgaagtat 
ttgaactaca ttcttttgga tgatattcag 
gattatacct acacggtgta tcgtgatggt 

20 accttcgaag aagacggcgt agctacgggc 
acagccggcg tatctccgaa agagtgcgta 
cctgtacaga acctgaccgg tagtgcagtc 
cctaatggta ccccgaatcc gaatccggga 
ggtattcctg cctcatggaa gacgatcgat 

25 acccctcctc ccggaggcac ctcttttgca 
gcttcttata tcaactttga aggccctcag 
ctttctcttc ctaacggagg aacgcttact 
gcatcagagc actatgccgt gtatgcatct 
aacgctttgt tggaagaagt gctgacggcc 

30 cgtggtactc gtgttcaggg cacctggtat 
aagtatgttg ccttccgtca cttcggctgt 
gttgagatca aggccaacgg caagcgcgca 
catggagagg caccagcgga atggactact 
ctctgtctgt cttccggaca attgggatgg 

35 gcctctttct catggaatgg aatggctttg 
gttacaggcg caacgaaggt aaagtactac 
cactatgcgg tgatgatctc caagacgggc 
gaagaaacgc ctaacggaat aaataagggc 
aatggcgcca aacctcaaag tgtatggatc 

40 aagtatgttg ctttccgtca ctacaattgc 
attcagttca ccatgggtgg cagccccacc 
gatggtacga agatcaagga aggtttgacc 
acgggcaatc atgagtattg cgtggaagtg 
tgcgtaaacg taactattaa tccgacacag 

45 caagctccta acagcatgga tgcaatcctt 
gaagttctga acgaagactt cgaaaatggt 
gacggtgacg gcaacaattg gacgacgacc 
cacaacagtg cgatctgtgt ctcttcggct 
cctgataact atctggttac accggagctt 

50 tgggtatgtg cacaagatgc caattatgca 
acgggtaacg acgcttccaa cttcgccaac 
acagttgtta cggcaccgga agccattcgt 
aagacggtac agttgcctgc gggtactaag 
gacttcttct ggatcaacct tgatgatgtt 

55 acctatacga tctatcgtaa taatacacag 
cgagatccgg acttggctac cggtttttac 
ggagaatcag ctatcgaaac tgctacgttg 
cagaagcctt acacgctgac agttgtagga 
gctatgatct acgacatgaa cggtcgtcgt 

60 acggctcagg gcggccacta tgcagtcatg 
aaactcgctg taaagtaa 
<212> Type : DNA 
<211> Length : 6318 

SequenceName : SEQ ID 556 

65 SequenceDe script ion : 



cctgcctcat ggaagacgat cgafcgcagac 2760 

cctcccggag gcacctcttt tgcaggtcac 2820 

tatatcaact ttgaaggccc tcagaaccct 28 80 

cttcctaacg gaggaacgct tactttctgg 2940 

gagcactatg ccgtgtatgc atcttctacg 30 0 0 

ttgttggaag aagtgctgac ggccaagaca 3 0 60 

actcgtgttc agggcacctg gtatcaaaag 3120 

gttgccttcc gtcacttcgg ctgtacggac 3180 

atcaaggcca acggcaagcg cgcagacttc 3240 

gaggcaccag cggaatggac tactatcgat 33 00 

ctgtcttccg gacaattgga ctggctgaca 33 60 

ttctcatgga atggaatggc tttgaatcct 342 0 

ggcgcaacga aggtaaagta ctactatgca 3480 

gcggtgatga tctccaagac gggcacgaac 3540 

acgcctaacg gaataaataa gggcggagca 36 00 

gccaaacctc aaagtgtatg gatcgagcgt 3 660 

gttgctttcc gtcactacaa ttgctcggat 3720 

ttcaccatgg gtggcagccc caccccgacc 3780 

acgaagatca aggaaggttt gaccgaaacg 3840 

aatcatgagt attgcgtgga agtgaagtac 39 00 

aacgtaactg ttgatcctgt gcagttcaat 3960 

ggccagaaag taacgctcaa gtgggatgca 4020 

acaacaacac tttccgaatc attcgaaaat 4080 

gcagacggtg acggcaacaa ttggacgacg 414 0 

ggtcacaaca gtgcgatctg tgtctcttcg 42 00 

aaccctgata actatctggt tacaccggag 4260 

ttctgggtat gtgcacaaga tgccaattat 4320 

tctacgggta acgacgcttc caacttcgcc 43 80 

aagacagttg ttacggcacc ggaagccatt 4440 

caaaagacgg tacagttgcc tgcgggtact 45 00 

acggacttct tctggatcaa cctcgatgat 4560 

gacttcacgg aaacgttcga gtcttctact 4620 

atcgatgccg atggcgatgg tcagggttgg 4680 

ctgacagctc atggcggcac caacgtagta 4740 

aatcctgata actatctcat ctcaaaggat 48 00 
tatgcagtca acgacggttt tcccggggat " 4860 

acgaacgccg gagacttcac ggtcgttttc 492 0 

ggagcaagat tcggtctttc cacggaagcc 4980 

gagcgtacgg tagatttgcc tgcgggcacg 5040 

tcggatttga actacattct tttggatgat 5100 

ccgaccgatt atacctacac ggtgtatcgt 5160 

gaaacgacct tcgaagaaga cggcgtagct 522 0 

aagtacacag ccggcgtatc tccgaaagag 52 80 

ttcaatcctg tacagaacct gaccjgcagaa 534 0 

aaatggaatg caccggcatc taagcgtgcg 5400 

attcctgcct catggaagac gatcgatgca 5460 

cctcctcccg gaggctcctc ttttgcaggt 5520 

tcttatatca actttgaagg tcctcagaac 558 0 
tctcttcctg gcggaggaac gcttactttc . 5640 

tcagagcact atgccgtgta tgcatcttct 5700 

gctttgttgg aagaagtgct gacggccaag 5760 

ggtactcgtg ttcagggcac ctggtatcaa 582 0 

tatgttgcct tccgtcactt cggctgtacg 58 8 0 

gtaatcactt cagggaacgc tccgtcttac 5940 

atagcatcag gcgtaacgga gactacttac 60 00 

acgtacggtg ttaaggttgt ttacccgaac 6060 

aatatcactt cgttggcaga cgtaacggct 612 0 

aagacgatca cggtaacttg ccaaggcgaa 618 0 

ctggcagcgg gtcgcaacac ggttgtttac 6240 

gttgtcgttg acggcaagtc ctacgtagag 63 0 0 

6318 



Sequence 
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<213> OrganisraName : Porphyromonas giiragivalis W83 
<400> PreSequenceString : 

atgaaaacat ctgaaagaat attaagttat ttctfccctct tatgtgctgt attcagtctg 60 

5 ggctcatgcg aaggacttta tgcacaggta actttcccaa attattcgcc tacggcggct 12 0 

tcgtccattg ctgtatgttc tggagaagag acattgatca ttgactttac tgtagtccag 180 

gaggattcga atggtatcaa agttaatgtg aaacttgccg acggtgtcga gtatgtggtc 240 

ggaacggctg tcgtaagtgt tacacagggc aatgcagtga cggtggcgga aaccaatgtt 3 00 

tctaatccga acgaacctgt atttacggta aaatcrggcgg atggaaacaa fcgtggtagag 360 

10 cttggaacca tcgttaagct gacgattaag agga^agctg tctgtaccgc atggagcaat 420 

gccattaatg ctgccgaaac gggttttgtc ttcaaagaca aggtaacggt gactatcggc 480 

gatcatagtg atagcaagga atcaaactcc tattcggtaa actatccgaa cctgacgatc 540 

aaacagcctg cgccgcaagt gaacaagcag attggggaga ccatcgtacg agagttttct 600 

ataaccaatg gttctcagaa cccgacccag acagtttatc tttcgataga-gtatcccgat 660 

15 gaagcctacc tcacaggggt gggggcgatg acgcfctcagg ctaaactggg tgcttccggc 720 

acctatgccg atcttactcc taccgtcact aacggtaagg tccgcatcta tacactttcg 780 

ggttccagtt tggggcctga tcatctcttg accaacggcg agatcatcta tctgaaagaa 840 

acatttaagc tgaaaacttg tgcaccggtt acggtzzctata gggtaggttg gggttgtagc 900 

atagatagcc agtgtgagat aaaaactacc gctgc=:tacaa ttactatggc agctggtgcg 960 

20 gctaatatca cgggatattc agttactggt ccfcgeattatc gttctccaac tttttctctt 102 0 

tgccaaccgt ttgagttgac tattaagttc tcaaattccg gtgctggtgg ctctatgggg 10 80 

gcagcattca atatcaatac tattggtaga aacgactatt atagaccaag agggtttgtt 1140 

ttacacgaat ttattgatgt caaagtaaac ggta^.gccgg taacgaattt caaaaccgat 12 0 0 

ggctcagagc tcgaccttcg ttttgatgga cagtfctacag aagatcctga cggaccgggg 1260 

25 gttggtttgg atgatgttga cggtgacgga ttttsitgatg atcttcctgt cggagctact 1320 

attacgatta ctgtaacggt gcggctaaag tgtgaitcagt ttacggcatg caacaacgct 1380 

ccaaatgatt tgtccgatag gggcttgatt cttaaaacac tatatcagac atcttgtgat 1440 

agaacctcat ggatagatcc caacacgtgg ttcaatcttt ctagtactca tttgtatttg 1500 

tctcgtgagt cggtacaaga tgcctctcac atgc<=:tactg taatagagaa ggatacgcct 1560 

30 ttcgacctga agataatgac ttcctactat tccatz:cctca gctcatataa taatatatgg 1620 

tacgccaatc ccaatacgcg gtatgtggta gaaatz:agtat tcccgcaagg tatgactatg 1680 

cctcccaaat cggatataga atggaccaat ataaaaaatc atccgafcaga cgggtcctta 1740 

gttttcactc caccgattaa cctccctgat gcaaatatca cgacatcagg aaacacaatg 1800 

actatcgttt -cgcccagcca agaaagaggt tttgtaaccc tccatggtgt gaaatacgac 18 60 

35 tgtacaaata atcacgaaat ggttgtggag tataagatta gagaggtatt caactacctt 1920 

cacttccctg attgtctttg cccggtaggt cctatitatgt gtaacacggc aaagcgttat 1980 

gttttgggct gcgatcctcc ctgcggtaga ggtgcggaaa ctfccggtgcc taagatagaa 2040 

cgtgccgaca attcgttggg ctggacggat tatacrgatgc ggacccgtca atctcgtagc 2100 

aatatatcgg cttacgactt ggctaaagcc ctatatatgg acgaagtcaa cattacagcg 2160 

40 acttctatcc agcatggtac tgcttcgtct ttgg^cgccc gttttgtttt ggctacgggt 222 0 

gtcgatcgag tagaaacgct tactcctctt tcggccgata ttaagatctt ccgtgatggg 2280 

gttcagattg tatcggtaga tggatatacg acatfcccgtt caatacgccg aaataataat 2340 

gcagagcagg tgatcgactg ggattttact tcaafcccttc ctgctggtgg attgcttgat 2400 

agagacaaag tggatgttgt tacccgttat cgggtaacat cccagaatgc tcatagagta 2460 

45 gatacgcaag tcggtaggga gtggttcfctc tacaactcta ccgccaatgt gtcaccgata 2520 

tgggatgaag ccaatccgtt aacttgtctc atacfctgtgc ccgagatata catcatgggt 258 0 

acttttgttg taaacggtac cgatccacat gtcatittcac aatgtactcc aacagatctg 2640 

ggacgcgttg ctaaccacta cgcccgtcgt ttcggctctg gtgcatttga atatgccaat 2700 

gaatatcgtc ctggtgtaaa gattagaaat atctatctga aagtaccgaa gtcctacacg 2760 

50 ctgaataggg tggagtatag caatcaccgt aaccatagtt cgttaggtac aaccatgcct 2820 

ttcgaggaaa taaatcatac agatgtgact tcacsigggtg aatataacat ctataagtat 2880 

caacttgcag acaacgaaaa ggcgcacttc aatafctacag taaaaaatgc ctatggagca 2940 

gctcttaaag taaatgtatc tcccacttgt gcgtczrgtctg ctgtagcaac taattatgat 3 0 00 

aaaatttcat actatgtcga ttacattgac tattactatt atgcagcaac gcagccaaca 3060 

55 gtacctaata gccttgacat agtagccgat caatcrggctg gcagcaacgg aatctacagt 3120 

gtttccgccc tcaatgttta caacaggcct atcctttata ctaacaaacc ttctattgcg 3180 

ctcgtcaatc agtcaggtga ggtagagctt gtgggtaaaa cgggagagtg gaagctgcgt 3240 

atcagcaacc catcgagtgc aacggctccc tatgtttggt tggcattgcc tacaacatcg 33 00 

gggctgacca tcgaaaaagt aactgatgcg gcagcjtactg aaatggcgtt tacaacttat 33 60 

60 tctggtggca agatgtatcg tttgtcggaa gctggtgttc cagtaggttc tgcgcttgac 3420 

tataccattc actttaccta ttctggttgc tctcczrtatcg ctttgaaggc gatggggggc 3480 

tggaactgta gtgcatatcc ccttagtttg gatgsigtatg tttgcagttc gcaggtgatc 3540 

gatctcaagc tcaagccact gccagctgcc atggsgctta ctgagatagc tgttccagat 3 600 

cctacagctg ctgctacatt gtgtagtaca ttggsiatata tttacagcat tcaatcgaca 3660 

65 gataatgcga acgtttatag tcctactttc agcafccttcc ccgaagaggg attggtggtg 3720 

acaccgaatc aggtacaggt ggaatatcct gccggttccg gtaattgggc tgcactcaat 3780 

gtggtcaata attccgtcaa tctgttgcag catccz:tgcat tgactaccat aggctatctc 3 840 
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aagggcctga aagaggggga' atccaacgac aatcaacgta aaattttggt gaagttctac 39 00 

ataaagaccg agtgttcgtt cgtatctggc aagaacttcc gtgtaagagc tgacggccgt 3960 

aatgcttgta atcagaatgc caagggatcg ggtcttgcca taagtacgcc tccaattaga 402 0 

ataaatggag ctatagagcc ctacacgact tctgcttcta cgcagcttgt tacgaccaca 4080 

5 acatcacaat cggactgtaa agctcccaaa agagtaaaag tggfcgcaaac ggtcgtaggt 4140 

ggagaaacta cccccaaggc atatttggaa atcacgctgc cgttgggctt taagtatgtg 4200 

acgggttctt atgctccgga caatacgcat ccgggaggag tcaatgcctc acctgccgga 4260 

acggaagaag tcactttgac cgcgaacggt gaagacaaga ttaagataaa tgtcaaggcc 4320 

ggtctgacgt caggtcaatc gtttgcttat acactcgaaa tgaaggaaga cgatgataat 43 80 

10 gtgccggctt gtggcaatca taccatcgaa attgtcaatg ttgaggagat tgaaggtttg 4440 

tggtgtgaag gcgttcagtg tgcagaaact ttggtcgtca cgggtgccaa caagtttgaa 450 0 

tttgagcttg ataagcctta cttggatatt acggttattt cagcagtatc gactttcagt 4560 

ggtggtaagg aaaatcttac aattgagtat aaggtaagca atacatcgac cacccagcct 4620 

ctgaaaccgg gagcggttgt aacgctgttc agcgataagg ataacaatca agtcttctcc 4680 

15 ggcggagatg ttgctgttgc aacacaggag ttggtcgcag aaataactaa taccacacct 4740 

cttacgcaga taatgaaggt aaaaggagtg agctcttccc atacgggcaa tttggttctt 4800 

acgatactgc ccaaagacgg ttgctactgt gagatcaaat cccctatggt cacgttaaac 48 60 

catcttcctt cgaattactg gatcggagga actgtaggta agcctaacga atggaaagag 4920 

ccgaacaact ggaccaatga ccaagttccc gatgcggcag aggatgttga attcgccact 4980 

20 gaggtgaata acccgactga tccgaataat ccgaagtcgg gtcctgcgaa ggagaacctg 5040 

catctggacg atatacacca gaatggcaca gccggtcgcg ttatcggcaa cctgatcaat 510 0 

gactctgaca aagatctggt aatcacaacg ggcaatcaat tgacgatcaa cggcgtggtt 5160 

gaggataaca atccgaatgt cggtacgatc gtcgtgaagt cgtcgaaaga caatcctacg 5220 

gggacgttgc ttttcgccaa tccgggcaat aatcaaaatg taggggggac tgtcgagttt 5280 

25 tacaatcagg gatatgattg tgccgattgt ggtatgtatc gcaggagctg gcagtatttc 5340 

ggtatccctg tcaatgaatc agattttcca tatgatcatg ttgatggaaa cgcgaccgtc 5400 

aaccaatggg ttgagccttt caatggcgat aagtggcggc ctgcacctta tgcacctgat 5460 

acaaagcttc agaaattcaa gggctatcag atcacgaatg acgtgcaggc acagcctacg 5520 

ggagtttaca gcttcaaggg tacgctttgt gtgtgcgatg ccttcctgaa cctgacacgc 558 0 

30 acgtccggtg tcaactactc gggcgccaac ttgatcggca actcatacac tggagctatc 5640 

gacatcaagc aaggtattgt cttcccgccg gaagtcgagc agacggtgta tctgttcaac 57 0 0 

acgggaacac gcgaccagtg gcgtaagctt aatggaagca cggtttcagg ctatcgagcc 5760 

ggtcagtacc tctctgtacc taagaataca gcgggtcagg acaatcttcc ggatcgtatt. 5820 

ccatcgatgc attccttctt ggtgaagatg cagaacggag cctcttgtac gttgcagatc 5880 

35 ttgtacgata agctgctcaa gaacacgact gtaaacaacg gtaatggtac gcagatcaca 5940 

tggcgatccg gcaactccgg atcggcgaat atgccgtcac ttgtgatgga tgttcttggt 60 00 

aatgagtcgg ccgaccgttt gtggatcttt accgatgggg gtctttcgtt cggattcgac 6060 

aacggctggg atggtcgcaa gctgactgaa aaaggtttgt cacaacttta tgcgatgtct 612 0 

gacatcggta atgataaatt ccaggttgca ggggttccgg agttgaataa cctgctgatc 6180 

40 ggcttcgatg cggataagga tggtcaatac acgttggagt ttgctctttc ggatcatttt 6240 

gcgaaaggcg gagttttcct tgaggatctt agccgtggag ttacacggcg agtagtcgac 63 00 

ggcggttcgt attcattcga tgccaagcga ggagactccg gggctcgttt ccgtctctct 6360 

tatgacgaag agtgggttga atcggcagag gtttctgttt tggttggtac ggccgggaag 6420 

cgaatcgtta tcacgaataa cagtgagcat gcctgtcagg ccaatgttta tacaactgac 64 80 

45 ggaaaacttc tgattcgatt ggacgtaaaa cccggaagta agtctatgac ggaaccattg 6540 

gtcgatggag tctacgttgt cagtctgcaa agtcctgcca cgagtagcaa tgtaaggaaa 6600 

gttgtagtca actaa 6615 
<212> Type : DNA 
<211> Length : 6615 

50 SeguenceName : SEQ ID 557 

SequenceDe script ion : 

Sequence 



55 <213> OrganisniName : Porphyromonas gingivalis W83 

<400> PreSequenceString : 

atgaacaaat tttacaaatc acttttgcag 

actgcactga ccgcttctgc acagatttcg 

agatccgccg gaacgcattc attcgacgat 
60 aatccggaag acctgatcgc acagagccgt 

cggataggac aagtaatacc ggtggatgtg 

tccatcggag acgtagatgt atatcgcctg 

acgctttatt acgatgcatt caatattccg 

gaccatgaaa ttgtgttggg agcatatacg 
65 gccacagagc cggtaccggg gagtgagctt 

actttgcctg acatcaagat ctccggtgcg 

cccgtaacgg ataaccatta cgggatcggt 



tcaggactgg ctgccttcgt 
ttcggagggg aacccttgag 
gcaatgacta tccgccttac 
tggcaatcgc aaagagatgg 
gactttgcat ccaaggcttc 
caattcaagt tggaaggagc 
gagggcggac gcctctatat 
aacgccactc atcgccgcaa 
attatggatt atgaagtgtc 
ggttatatat tcgacaaagt 
gaggacgatt ccgattcgga 



gtcgatggca 60 

tttctcttca 120 

tccggatttc 180 

ccggcccgtc 240 

gcacatctct 3 00 

caaagccatt 360 

ctataccccc 420 

cggagctttt 480 

tcgcggaggg 540 

cggcggacgc 600 

ttgcgagatc 660 
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aacatcaatt gtcctgaagg tgcagactgg caggcagaga agaacggtgt ggtgcaaatg 72 0 

atcatggtaa aaggacagta tatctcaatg tgctcaggca acctgctcaa taatacgaaa 780 

ggagacttta ctccgctgat catttctgcc ggacactgtg cttccataac aaccaatttc 840 

ggtgtaacgc aatccgagtt ggataagtgg atcttcactt tccactatga aaaaagagga 900 

5 tgcagcaatg gtacattggc catcttccgt ggcaacagta tcatcggagc ttccatgaag 960 

gctttcctcc cgatcaaagg taaatccgat ggtctcttgc tgcaactcaa cgatgaagtc 1020 

cctctgcgct atcgtgtcta ttacaatgga tgggacagta cgcccgatat tccctcgagc 10 8 0 

ggtgccggta ttcatcatcc ggccggagat gccatgaaga tttccatcct aaagaagact 1140 

ccggctctga atacatggat ctcctccagt ggttccggag ggactgacga tcacttctat 120 0 

10 ttcaaatacg atcaaggtgg tacggaagga ggatcgtccg gttcttctct cttcaatcag 1260 

aataagcacg tggtcggcac actgaccgga ggtgccggca attgtggcgg gacggagttc 1320 

tacggcagac tgaacagtca ttggaacgag tatgcatccg atggcaatac gagccgcatg 13 80 

gacatctatc tggatcccca aaacaatggc cagacgacca tcctcaacgg aacgtatcgt 144 0 

gacggttata agcctttgcc ctctgtgccc cggctattgt tgcagtctac aggcgatcag 1500 

1?> gtcgaattga attggacggc tgttcctgcc gatcaatatc catcatctta tcaggtcgaa 1560 

taccacatat tccgaaatgg aaaggaaata gctacgacaa aggagttgtc ctattcggat 1620 

gccatcgacg aaagtattat cggtagcggt atcattcgat acgaagtaag cgcacgcttc 168 0 

atttatccct cgccgttgga tggagtggaa tcttataagg atacggacaa gacttctgcc 1740 

gaccttgcca taggagacat tcagaccaag ctgaagccgg acgtaacacc tctccccgga 180 0 

20 ggaggagtat cattaagctg gaaagttcct ttcttaagcc agttggtttc ccgattcgga 1860 

gaaagcccca atcctgtgtt caaaaccttt gaagtgccct atgtttctgc cgcagccgca 1920 

caaaccccca atcctcccgt tggcgtagtc attgcagaca agtttatggc cggtacatat 198 0 

cccgaaaagg ctgctatcgc tgccgtttat gtaatgccat ccgctccgga ctctactttc 204 0 

cacctcttcc tcaagagcaa cacaaacaga agattgcaga aggtgacaac tccctccgat 210 0 

25 tggcaggccg gaacatggtt gaggatcaat ttggataagc cgttcccggt gaataatgac 2160 

catatgcttt ttgccggtat cagaatgcct aataagtaca agctcaatcg tgctatccgt 2220 

tatgtaagaa atccggataa ccttttctcc attaccggta agaagatttc atataacaac 22 80 

ggagtctctt tcgaaggcta cggaataccc tcgctcttgg gctatatggc tatcaaatat 234 0 

ctggtggtaa ataccgatgc tccgaagatc gatatgtcgc ttgtacagga gccttatgct 240 0 

30 aagggaacga atgtggctcc attccccgaa ttggtcggca tatatgtcta taagaacgga 2460 

acatttatcg gcacacagga tccatccgtc acaacttatt cggtttcaga cggaacagag 2520 

agcgatgaat acgaaataaa actggtatat aagggatcgg gcatttcgaa tggcgttgct 2580 

cagattgaga ataacaatgc tgtcgttgca tatccgtctg ttgtaacaga tcgtttcagc 2640 
attaagaacg ctcatatggt tcacgctgcc gccctctact cattggatgg caagcaggtt .. 27 00 

35 cgttcttgga acaacctccg caatggcgtg acattcagtg ttcaaggact tacggccggt 2760 

acttatatgc tcgttatgca gacggcaaac ggccctgtga gccaaaagat cgtgaagcag 2820 

tag " " ' 2823 

<212> Type : DNA 
<211> Length. : 2823 

40 SeguenceName : SEQ ID 558 

SequenceDescription : 



Sequence 

45 <213 > Organi sraName : Porphyromonas gingivalis W83 
<4 00> PreSequenceString : 

atgaaaaact tgaacaagtt tgtttcgatt gctctttgct cttccttatt aggaggaatg 60 

gcatttgcgc agcagacaga gttgggacgc aatccgaatg tgagattgct cgaatccact 120 

cagcaatcgg tgacaaaggt tcagttccgt atggacaacc tcaagttcac cgaagttcaa 180 

50 acccctaagg gaatggcaca agtgccgacc tatacagaag gggttaatct ttctgaaaaa 240 

gggatgccta cgcttcccat tctatcacgc tctttggcgg tttcagacac tcgtgagatg .3 00 

aaggtagagg ttgtttcctc aaagttcatc gaaaagaaaa atgtcctgat tgcaccctcc 3 60 

aagggcatga ttatgcgtaa cgaagatccg aaaaagatcc cttacgttta tggaaagagc 420 

tactcgcaaa acaaattctt cccgggagag atcgccacgc ttgatgatcc ttttatcctt 4 80 

55 cgtgatgtgc gtggacaggt tgtaaacttt gcgcctttgc agtataaccc tgtgacaaag 540 

acgttgcgca tctatacgga aatcactgtg gcagtgagcg aaacttcgga gcaaggcaaa 600 

aatattctga acaagaaagg tacatttgcc ggctttgaag acacatacaa gcgcatgttc 660 

atgaactacg agccagggcg ttacacaccg gtagaggaaa aacaaaatgg tcgtatgatc 720 

gtcatcgtag ccaaaaagta tgagggagat attaaagatt tcgttgattg gaaaaaccaa 7 80 

60 cgcggtctcc gtaccgaggt gaaagtggca gaagatattg cttctcccgt tacagctaat 840 

gctattcagc aattcgttaa gcaagaatac gagaaagaag gtaatgattt gacctatgtt 900 
cttttgattg gcgatcacaa agatattcct gccaaaatta ctccggggat caaatccgac 960 

caggtatatg gacaaatagt aggtaatgac cactacaacg aagtcttcat cggtcgtttc 1020 

tcatgtgaga gcaaagagga tctgaagaca caaatcgatc ggactattca ctatgagcgc 1080 

65 aatataacca cggaagacaa atggctcggt caggctcttt gtattgcttc ggctgaagga 1140 

ggcccatccg cagacaatgg tgaaagtgat atccagcatg agaatgtaat cgccaatctg 1200 

cttacccagt atggttatac caagattatc aaatgttatg atccgggagt aactcctaaa 1260 
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aacattattg atgctttcaa cggaggaatc 
gaaacagctt ggggtacgtc tcacttcggc 
aaccagctac cgtttatttt cgacgtagct 
ccttgtttcg cagaagcatt gatgcgtgca 
5 gctatcatag cgtctacgat caaccagtct 
atgaacgaaa ttctgtgcga aaaacacccg 
accatgaacg gtatgtttgc tatggtggaa 
gacacatgga ctgtattcgg cgacccctcg 
atgcaggtta cggctccggc tcagattaat 

10 gattataatg gtgctattgc taccatttca 
gtcgaaaatg gaacagctac aatcaatctg 
cttacagtag ttggttacaa caaagagacg 
cctaacccct accagcctgt ttccaacttg 
. Gtcaagtggg atgcaccgag cacgaaaacc 

15 gatggcatac "gagaactggt tcttctgtca 
ggtcaggccg agattgttct tgaagctcac 
attcttttgg atgcagacca tgatcaatat 
ctttggccga actgtagtgt cccggccaat 
gaaaatgcag atccttcttg ttcccctacc 

20 aatataccgg ccggaactta tgactttgca 
tggattgccg gacaaggacc gacgaaagaa 
taccatttcc ttatgaagaa gatgggtagc 
ggtggtggaa gcgattacac ctatactgtc 
ctgacggcta cgacattcga agaagacggt 

25 gaagttaagt acacagccgg cgtatctccg 
tccaatgaat ttgctcctgt acagaacctg 
cttaagtggg atgcacctaa tggtaccccg 
aatcccggaa caactacact ttccgaatca 
acgatcgatg cagacggtga cgggcatggc 

30 ggctacaata gcaatggttg tgtatattca 
cttacccctg acaactatct gataacaccg 
actttctggg tatgcgcaca ggatgctaat 
tcttcgaccg gtaacgatgc atccaacttc 
gcaaaaggtg ttcgctcgcc ggaagctatt 

35 aagacggtag accttcccgc aggtacgaaa 
gatatgttct acatcgacct tgatgaggtt 
ttcacggaaa cgttcgagtc ttctactcat 
gatgccgatg gcgatggtca gggttggctc 
acagctcatg gcggcaccaa cgtagtaagc 

40 cctgataact atctcatctc aaaggatgtt 
gcagtcaacg acggttttcc cggggatcac 
aacgccggag acttcacggt tgttttcgaa 
gcaagattcg gtctttccac ggaagccgat 
cgtacggtag atttgcctgc gggcacgaag 

45 gatttgaact acattctttt ggatgatatt 
accgattata cctacacggt gtatcgtgat 
acgaccttcg aagaagacgg cgtagctacg 
tacacagccg gcgtatctcc gaagaaatgt 
aatcctgtaa agaacctgaa ggcacaaccg 

50 gccccgagcg caaagaagac agaaggttct 
ttcgttacga tcgaacctgc aaacgatgta 
gcagacaacg tatggggaga caatacgggt 
acattcggaa gtgtcattcc ggcaaccggt 
ctttacagtg cgaacttcga gtatttgatc 

55 cagaatatta tcgttacagg acagggtgaa 
tgcattacga acccggaacc tgcatccgga 
cagcctgcac gttatgacga tttcacattc 
cgtcgcgccg gaatgggaga tggaactgat 
tatacctata cagtctatcg tgacggcacg 

60 taccgcgatg caggaatgag tgcacaatct 
gccggcgtat ctccgaaggt ttgtgtggat 
gctcagaagc cttacacgct gacagttgta 
gaagctatga tctacgacat gaacggtcgt 
tacacggctc agggcggcta ctatgcagtt 

65 gagaaactcg ctatcaagta a 
<212> Type : DNA 
<211> Length : 5121 



tcgttggcca actatacggg ccacggtagc 1320 

accactcatg tgaagcagct taccaacagc 13 80 

tgtgtgaatg gcgatttcct attcagcatg 1440 

caaaaagatg gtaagccgac aggtactgtt 1500 

tgggcttctc ctatgcgcgg gcaggatgag 1560 

aacaacatca agcgtacttt cggtggtgtc 1620 

aagtataaaa aggatggtga gaagatgctc 1680 

ctgctcgttc gtacacttgt cccgaccaaa 1740 

ttgacggatg cttcagtcaa cgtatcttgc 18 00 

gccaatggaa agatgttcgg ttctgcagtt 18 60 

acaggtctga caaatgaaag cacgcttacc 1920 

gttattaaga ccatcaacac taatggtgag 1980 

actgctacaa cgcagggtca gaaagtaacg 2040 

aatgcaacca ctaataccgc .tcgcagcgtg 2100 

gtcagcgatg cccccgaact tcttcgcagc 2160 

gatgtttgga atgatggatc cggttatcag 2220 

ggacaggtta tacccagtga tacccatact 22 80 

ctgttcgctc cgttcgaata tacggttccg 2340 

aatatgataa tggatggtac tgcatccgtt 24 00 

attgctgctc ctcaagcaaa tgcaaagatt 2460 

gatgattatg tatttgaagc cggtaaaaaa 252 0 

ggtgatggaa ctgaattgac tataagcgaa 2580 

tatcgtgacg gcacgaagat caaggaaggt 2640 

gtagctgcag gcaatcatga gtattgcgtg 2700 

aaggtatgta aagacgttac ggtagaagga 2760 

accggtagtg cagtcggcca gaaagtaacg 2820 

aatccaaatc caaatccgaa tccaaatccg 28 80 

ttcgaaaatg gtattcctgc ctcatggaag 2940 

tggaagcctg gaaatgctcc cggaatcgct 3000 

gagtcattcg gtcttggtgg tataggagtt 3 060 

gcattggatt tgcctaacgg aggtaagttg 3120 

tatgcatccg agcactatgc ggtgtatgca 3180 

acgaatgctt tgttggaaga gacgattacg 3240 

cgtggtcgta tacagggtac ttggcgccag 33 00 

tatgttgctt tccgtcactt ccaaagcacg 3360 

gagatcaagg ccaatggcaa gcgcgcagac 342 0 

ggagaggcac cagcggaatg gactactatc 3480 

tgtctgtctt ccggacaatt ggactggctg 3540 

tctttctcat ggaatggaat ggctttgaat 3600 

acaggcgcaa cgaaggtaaa gtactactat 3660 

tatgcggtga tgatctccaa gacgggcacg 3720 

gaaacgccta acggaataaa taagggcgga 3780 

ggcgccaaac ctcaaagtgt atggatcgag 3840 

tatgttgctt tccgtcacta caattgctcg 3900 

cagttcacca tgggtggcag ccccaccccg 3960 

ggtacgaaga tcaaggaagg tttgaccgaa 4020 

ggcaatcatg agtattgcgt ggaagtgaag 408 0 

gtaaacgtaa ctgttaattc gacacagttc 4140 

gatggcggcg acgtggttct caagtgggaa 42 00 

cgtgaagtaa aacggatcgg agacggtctt 4260 

cgtgccaacg aagccaaggt tgtgctcgca 4320 

taccagttct tgttggatgc cgatcacaat 438 0 

cctctcttta ccggaacagc ttcttccgat 4440 

ccggccaatg ccgatcctgt tgttactaca 4500 

gttgtaatcc ccggtggtgt ttacgactat 4560 

aagatgtgga tcgcaggaga tggaggcaac 462 0 

gaagcaggca agaagtacac cttcacgatg 4680 

atggaagtcg aagacgattc acctgcaagc 4740 

aagatcaagg aaggtctgac cgaaacgacc 4800 

catgagtatt gcgtggaagt taagtacaca 4860 

tatattcctg acggagtggc agacgtaacg 4920 

ggaaagacga tcacggtaac ttgccaaggc 4980 

cgtctggcag ccggtcgcaa cacggttgtt 5040 

atggttgtcg ttgacggcaa gtcttacgta 5100 

5121 
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SequenceName : SEQ ID 559 
SequenceDescription : 

Sequence 
5 

<213> OrganisraName : Porphyromonas gingivalis W83 

<4 00> PreSequenceString : 

atgaaaagaa aaccgctatt ctcagccctt 

cacccggcct cagcacagaa agttcctgca 
10 ctaagtgaag ccgatgtgga gtgtacaatc 

gacatttggg cagacctcaa cggaaacggc 

ggtgagtttc gtgatgttga gttcagacaa 

aaattcttgt ttagaggttc ttctgcaggg 

aattgtaccg gcctgactgc tttcgactgc 
15 tccaaagcaa acggtctgac ttttgtaaac 

ctgcccgcaa afcgcggacat tgagacgctg 

aacctatcga cctataccaa gctgaaagag 

ttggatctct ccgccaatac gctcctcgaa 

acgataaacc tgtctgccaa tacgaacttg 
20 accggactcg atgtcgcagc caacaaagag 

ctgaccgccc tcaatctctc ggccaatacc 

gagctgacaa atatcgatct ctccgacaac 

aataagctga cgaagttaga tgtatctgcc 

aacaaccagc tgactgctct ggatctgtca 
25 tactccaacc ggatcaaaga taccgccatg 

actgaaggag aaggcaggtt cgttccttac 

gtgtgtacaa ccgaacacgt ggaaatggcc 

tggggagagc ctttccccgg aataacggct 

tccgtatatg ctcaagatgg catcctctac 
30 caggtatata ccgtgggagg aagcatgatg 

gaaatacagc tcccgagagg tgcagcctat 

accgcgatgc cgtaa 

<212> Type : DNA 

<211> Length : 1455 
35 SequenceName : SEQ XD 560 

SequenceDescription : 

Sequence 



40 <213> OrganismName r Shigella flexneri 2a str. 2457T 
<40 0> PreSequenceString r 

atgaaacgag ctattaccct gtttgctgta ctgctgatgg gctggtcggt aaatgcctgg 60 

tcattcgcct gtaaaaccgc caatggtacc gctatcccta ttggcggtgg cagcgctaat 120 

gtttatgtaa accttgcgcc tgtcgtgaat gtggggcaaa acctggtcgt agatctttcg 180 

45 acgcaaatct tttgccataa cgattatccg gaaaccatta cagactatgt cacactgcaa 240 

cgaggctcgg cttacggcgg cgtgttatct aatttttccg ggaccgtaaa atatagtggc 300 

agtagctatc catttccgac caccagcgaa acgccgcggg ttgtttataa ttcgagaacg 360 

gataagccgt ggccggtggc gctttatttg acgcctgtga gcagtgcggg cggggtggct 420 

attaaagctg gttcattaat tgccgtgctt attttgcgac agaccaacaa ctataacagc 480 

50 gatgattttc agtttgtgtg gaatatttac gccaataatg atgtggtggt gcccactggc 540 

ggttgtgatg tttctgctcg tgatgtcacc gttactctgc cggactaccc tggttcagtg 600 

ccaattcctc ttaccgttta ttgtgcgaaa agccaaaacc tggggtatta cctctccggc 660 

acaaccgcag atgcgggcaa ctcgattttc accaataccg cgtcgttttc accagcgcag 720 

ggcgtcggcg tacagttgac gcgcaacggt acgattattc cagcgaataa cacggtatcg 780 

55 ttaggagcag taggaacttc ggcggtaagt ctgggattaa cggcaaatta cgcacgtacc 840 

ggagggcagg tgactgcagg gaatgtgcaa tcgattattg gcgtgacttt tgtttatcaa 900 

taa 903 
<212> Type : DNA 
<211> Length : 903 

60 SequenceName : SEQ ID 561 

SequenceDescription : 

Sequence 



gtaatccttt ccggcttctt cggatcggtt 60 

cccgtcgatg gcgagcgcat tatcatggag 12 0 

aaaatagaag ccgaggatgg ctatgccaac 180 

aagtacgatt cgggggagag gctcgattca 240 

acaaaggcca tcgtctatgg caaaatggcc 3 00 

gactatggtg ctacctttat agatattagc 360 

tttgccaatc tgctgacaga actcgatctg 420 

tgcggcaaaa accagctgac caagcttgac 480 

aactgctcca aaaacaagat aacgagtctc 54 0 

ctttatgtgg gcgacaacgg gctgacagcc 60 0 

gagctggtgt attctaacaa cgaggtgact 660 

aaaagcctgt attgcataaa caataagatg 72 0 

ctgaaaatac tccactgcaa caacaatcag 780 

aagctgacga ctctaagctt cttcaacaac 840 

acggctttgg agtggctttt ctgcaacggc 90 0 

aacgccaatc tgatagcact gcaatgcagc 960 

aaaacgccga aactgacaac gttgaattgc 1020 

cgtgcattga tcgaaagcct gcctacgatc 10 8 0 

aacgacgatg aaggaggaga agaggagaac 1140 

aaggccaaga attggaaggt acttacctcg 12 00 

ttgatttcca tcgaaggtga gagcgaatat 12 60 

ctctccggta tggagcaggg cttgcccgtt 13 2 0 

tactcatctg tcgcttccgg atcagccatg 13 80 

gtagtacgta tcggcagcca tgcgatcaaa 1440 

1455 



65 <213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgggcatca aacaacacaa tgggaatacc aaagccgatc gtctcgctga attaaaaatc 



60 
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cgttcgccct caattcaact gataaaattt ggcgctattg gtttg^tatgc aattatcttt 120 

tcccccctgc tgatagctgc tgatacagga agtcaatatg gcaccaatat tactattaat 180 

gatggtgaca gaattacagg agataccgcc gatccatcag gaaacctcta tggtgtaatg 240 

accccagcag gaaacacgcc tggcaatatc aacctgggta atgatgtcac cgtcaatgtc 3 00 

aacgacgcct ctggatatgc aaaaggaatc attattcagg gcaaaaacag ctccctgaca 3 60 

gctaaccgac tcacagtaga tgttgttggt caaacctctg ccatcggcat taacttaatt 42 0 

ggtgactata cccatgctga cttaggcaca ggcagcacca ttaagagtaa cgatgacggc 480 

atcattattg ggcatagctc aacactaaca gccactcaat tcaccattga aaactcgaac 540 

ggtataggcc taaccatcaa tgactatggc accagtgtcg atctt^gaag cggaagtaaa 600 

atcaagaccg atggaagtac aggtgtttat atcggtggtc tcaacggcaa taacgccaat 660 

ggtgctgcgc gttttacggc gacagacctg acaatcgatg ttcag^gcta cagcgccatg 720 

gggataaacg tacagaaaaa ctctgttgtc gatctcggaa caaacagtac cattaaaacc 780 

aatggcgata atgctcacgg cctctggagc tttggccagg ttagc^cgaa tgcactcact 840 
gtfcgatgtaa .ctggagccgc ggccaatggc gtcgaagttc gtggt^gtac .aaccactatc f * ,900 

ggtgcagata gccatatttc ttccgcgcag ggcggtggcc tcgtcaccag tagttcagac 960 

gcgacaatca atttttctgg cacggcagcg caacgaaaca gcatcttttc cggcggttct 1020 

tatggtgcct cggcccagac ggcaacggct gttatcaaca tgcaaaatac cgatattacg 1080 

gttgatcgta atggcagtct ggcgctgggt ttgtgggcgc tcagc^gcgg tagaataacc 1140 

ggagacagtt tggctatcac cggcgcggca ggagccagag ggatt-fcatgc catgaccaac 12 0 0 

agccagatcg acctcacgag cgatctggtc attgatatga gtacacccga ccagatggcc 1260 

atcgcaacgc aacatgacga tggttatgcc gccagccgca tcaacgcctc gggtcgtatg 1320 

cttatcaacg gtagcgttct ttccaaaggt gggctaatca atctggatat gcaccctggg 1380 

tcggtttgga caggttcctc cctcagcgat aatgtcaatg gcgggaaact ggacgttgca 1440 

atgaataaca gcgtctggaa cgtaacaagt aattctaatc tcgacacgct ggcgctgagc 1500 

cattcaactg tcgattttgc cagccacggg tcaactgccg gcacatttac cacattaaac 1560 

gtagagaacc tgagcggtaa cagtaccttt attatgcgtg ctgatgttgt tggcgagggt 1620 

aatggcgtta ataatagagg ggatttattg aatatcagcg ggagtiagtgc tggtaatcac 168 0 

gtattggcta tccgcaacca gggcagcgag gccacaacgg gaaat^aagt tctgacagtg 1740 

gtaaaaacca ctgacggcgc ggcctcgttc agcgcgtctt ctcagcgttga gttgggggga 180 0 

tatctgtacg atgtgcgtaa aaatggcact aactgggagc tttacgcttc cgggacagtt 1860 

ccggaaccga ctcctaatcc tgaacccaca ccagctcccg ctcagcctcc catagtcaac 1920 

cccgatccta cgcctgaacc cgctcccacg cctaaaccca ccacgaccgc agatgctggc 1980 

ggcaattatc tcaatgtcgg ttacttattg aactatgttg aaaaccgtac gctgatgcaa 2 040, 

cggatgggtg acttgcgaaa tcagagtaaa gacggtaata tctgg-fctgcg cagttatggg 210 0- 

ggaagcctgg actcctttgc cagtggcaaa ctgagcggct ttgaca.fcggg ttacagcggt 2160 

atccagtttg gtggggataa acgtctctct gatgtaatgc cgttg-tatgt cggtctgtat 2220 

attgactcaa cacatgcatc gccggactat agcggaggcg acggtaccgc acgttcagac 2280 

tacatgggaa tgtacgccag ttacatggca caaaacggtt tttacagcga tctcgttata 2340 

aaagcatcgc gccagaaaaa tagtttccac gtactggaca gtcagaacaa cggcgttaac 2400 

gccaacggca ctgcgaatgg aatgagcatc tccctggaag ccgggcagag gttcaacctg 2460 

tcccctactg gttatgggtt ctatatagag ccgcaaaccc agcttacata cagccaccag 252 0 

aatgagatgg ctatgaaggc gagtaatggc ctcaatatac atctgaatca ctacgaatcg 258 0 

ctgctggggc gtgccagcat gatactgggg tatgacatca ccgca,ggcaa cagccagctg 2640 

aatgtctatg tgaagactgg cgctatccgc gagttttcag gggat accga atatctgttg 2700 

aacgactccc gggagaagta cagtttcaaa ggtaatggct ggaat«a.acgg cgfcgggagtc 2760 

agtgcacagt ataacaaaca gcacacattc tatctcgaag cggattacac gcagggtaac 2820 

ctctttgatc agaagcaagt caacggagga tatcgcttca gcttt-taa 2868 
<212> Type : DNA 
<211> Length : 2868 

SequenceName : SEQ ID 562 

SequenceDescription : 



Sequence 



<213> OrganismName : Shigella flexneri 2a str. 24 57T 
<400> PreSequenceString : 

atgagtaagt ttgttaaaac agctattgct gcggcaatgg tgatg-cggcgt gttcacctct 60 

acggcaacaa tcgctgcggg taacaacggt actgcacgtt tttacrggcac cattgaagac 12 0 

tcagtgtgtt ctatcgttcc ggacgatcac aaactggaag ttgafc atggg tgatatcggt 18 0 

gccgaaaaac tgaaaaataa cggcaccacc acgccgaaaa gtfctccagat tcgtctgcaa 240 

gattgcgtat ttgatactca ggaaacaatg accactacct ttacfcggtac cgtttcttct 300 

gcaaatagcg gcaattatta caccattttc aataccgata ccggfccjcggc atttaacaat 360 

gttagcctgg cgatcggtga ctctctgggt acctcttaca aaagcggcat gggtattgac 420 

cagaaaatag tgaaagacac ttctaccaac aaaggcaaag cgaag-cagac actgaacttt 480 

aacgcctggc tggtcggcgc agctgatgcg ccagatctgg gtaattttga agcaaatacc 540 

accttccaga ttacttacct gtaa 564 
<212> Type : DNA 
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<211> Length : 564 

SequenceName : SEQ ID 563 
SequenceDescription : 

5 Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaaatta aaactctggc aatcgttgtt ctgtcggctc tgtccctcag ttccgcagcg 60 

10 gctctggccg atactacgac ggtaaatggt gggaccattc actttaaagg ggaagttgtt 120 

aacgccgctt gcgcagttga tgcaggctct gttgatcaaa ccgttcagtt gggacaggtt 180 

cgtaccgcta gcctgaagca ggctggagca accagctctg ccgttggttt taacattcag 24G 

ctgaatgatt gcgataccac tgttgccaca aaagccgctg ttgccttctt aggtacggca 3 00 

attgatgcta cgcgtactga, tgtactggct ctgcagagtt ctgctgcagg tagtgcaaca_. 3 60 

15 aacgttggcg tgcagatcct agacagaaca ggcaatgctc tgacgctgga cggtgcgaca 420 

tttagtgcac aaacaaccct gaataacggt accaacacca ttccgttcca ggcgcgttat 480 

tatgcaatcg gcgaggcaac cccgggtgca gctaatgcgg atgcaaccfct caaggttcag 540 

tatcaataa 549 
<212> Type : DNA 
20 <211> Length : 549 

SequenceName : SEQ ID 564 
SequenceDescription : 



25 



Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atggcaagta tttcatcgct gggagtcggg tcaggtctgg atttaagttc aatccttgat 60 

agcctcaccg ccgcgcaaaa agcgacgcta acccctattt caaatcagca atcgtcgttt 120 

30 accgctaaac ttagcgccta cggtacgctg aaaagcgcgc tgacgacttt ccagaccgcc 180 

aatactgcat tgtctaaagc cgatcttttt tccgccacca gcaccaccag cagcaccacc . 240 

gcgttcagtg ccaccaccgc gggtaacgcc atcgccggga aatacaccat cagcgtcacc 300 

catctggcgc aggcgcaaac cctgaccacg cgcaccacca gagatgatac gaaaacggcg 360 

atcgccacca gcgacagcaa actcaccatt caacaaggcg acgacaaaga tccgattacc 420 

35 attgatatca gcgcggctaa ctcatcgtta agcgggatcc gtgatgccat caacaacgca 480 

aaagcgggcg taagtgcgag catcattaac gtgggtaacg gtgaatatcg tctgtcagtc 540 

acatcaaatg acaccggcct tgataatgcg atgacactct cggtcagcgg tgatgatgcg 600 

ctacaaagtt ttatgggcta tgacgccagt gccagcagca acggtatgga ggtctcggtc 660 

gccgcccaga atgcgcagct gacggtcaac aacgtcgcca tcgagaacag cagcaacact 720 

40 atcagcgacg cgctggaaaa catcactctg aacctgaacg atgtcaccac gggcaaccag 78 0 

acgctaacca tcactcagga cacctccaaa gtgcaaacgg cgattaaaga ctgggtgaat 840 

gcctataact cgctaataga taccttcagc agcctgacca aatacaccgc cgtagatgcg 90 0 

ggagctgata gccagagttc tagcaatggc gcactgctcg gcgactccac gctgcggacg 960 

attcagacgc agttgaagtc gatgctgagt aataccgtca gttcttccag ctataaaacg 102 0 

45 ctggcgcaga ttggtatcac gaccgatccc agcgatggca aactggaact ggatgccgac 108 0 

aaactcaccg ctgcactgaa aaaagatgcc agcggcgtag gtgcattgat tgttggcgat 1140 

ggtaaaaaaa ccggcatcac aaccaccatc ggcagcaacc tgaccagttg gctttcgaca 120 0 

acgggcatta ttaaagccgc taccgatggc gttagtaaaa ccctgaataa gttaactaaa 1260 

gactacaacg ccgccagcga tcgcattgat gcgcaggtcg cgcgctacaa agaacaattt 1320 

50 acccaactgg acgttttaat gacctcgtta aacagcacca gcagctactt aacgcagcag 1380 

ttcgaaaaca acagtaattc caagtaa 1407 
<212> Type : DNA 
<211> Length : 1407 

SequenceName : SEQ ID 565 

55 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

60 <400> PreSequenceString : 

gtggagggta aagctgataa tgtcgtactg gaaaatggcg gacgcctgga tgtgctgacc 60 

ggacacacag ccactaatac ccgcgtggat gatggcggaa cgctggatgt ccgcaacggt 120 

ggcaccgcca ccaccgtatc catgggaaat ggcggtgtac tgctggccga ttccggtgcc 180 

gctgtcagtg gtacccggag cgacggaaag gcattcagta tcggaggcgg tcaggcggat 240 

65 gccctgatgc tggaaaaagg cagttcattc acgctgaacg ccggtgatac ggccacggat 300 

accacggtaa atggcggact gttcaccgcc aggggcggca cactggcggg caccaccacg 360 

ctgaataacg gcgccatact taccctttcc gggaagacgg tgaacaacga taccctgacc 420 
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atccgtgaag gcgatgcact cctgcaggga 
aaatcaggaa gtggcacact cactgtcagc 
ctgaatgaag gcacgctgac gctgaacgac 
cgcggtacag ccctgaagct gaccggcagc 
5 aatgtcactc tcgcctctgg tgccacctgg 
gtggtggatg acctcagcca tgccggacag 
ttcgtaccgg caaccctgaa agtgaaaaac 
cgtgtacgcc cggatatggc acagaacaat 
gcaaccggaa aaaccatcct gaacctggtg 

10 accagcggta agggtattca ggtggtggaa 
gcctttatcc aggggaataa gctgcaggcc 
agtgatgaga gctggtatct gcgcagtgaa 
gcctccatgc tgacacaggc aatggactat 
cagaccggtg taagcggtga aaataacagc 

15 ggtcacgata acaacggcgg tattgcccgt 
ggcttcgtcc gtctggaggg tgacctgctc 
gcgggggtat atggtgctgc tggccatfcct 
cgtgccggca cggtccggga tgatgccggc 
aacgcctccg gcctgtgggc tgacattgtg 

20 tcatcggaca ataacgactt ccgcgtccgg 
ggtctgccct tcagtatcac tgacaatctg 
cagggactct ccctggatga cggccaggat 
agtgcacaac atgtgcgtgc cggcttccgt 
aaaggcacct catcccgtga caccctgcgc 

25 ccggtgaact ggtgggtaca gccttctgtt 
agcatgggta cagccgcagc cggcagtaac 
tcactggacc tgcaggccgg actggaagcc 
caggccagtt atgcccacag catcaacggc 
acactgaatg taaccttctg a 

30 <212> Type : DNA 

<211> Length : 2121 

SequenceName : SEQ ID 566 
SequenceDescription :. 



ggcgctctca ccggtaacgg cagcgtggaa 480 

aacaccacac tcacccagaa agccgtcaac 54 0 

agtaccgtca ccacggatgt cattgctcag 600 

actgtgctga acggtgccat tgaccccacg 660 

aatattcccg ataacgccac ggtgcagtcg 720 

attcatttca cctccacccg cacagggaag 780 

ctgaacggac agaatggcac catcagcctg 840 

gctgacagac tggtcattga cggtggcagg 90 0 

aacgccggca acagtgcgtc ggggctggcg 960 

gccattaacg gtgccaccac ggaggaaggg 1020 

ggtgccttta actactccct caaccgggac 10 80 

aatgcttatc gtgcagaagt ccccctgtat 114 0 

gaccggattc tggcaggctc ccgcagccat 120 0 

gtccgtctca gcattcaggg , cggtc.atctc ,JL2'60 

ggagccacgc cggaaagcag cggcagctat 13 20 

agaacagagg ttgccggtat gtctgtgacc 13 80 

tccgttgatg ttaaggatga tgacggctcc 1440 

agcctgggcg gatacctgaa tctgatacac 150 0 

gcccagggaa cccgccacag catgaaagcc 1560 

ggctggggct ggctgggttc gctggaaacc 162 0 

atgctggagc cgcaactgca gtacacctgg 168 0 

aacgccagtt atgtgaagtt cgggcatggc 1740 

ctgggcagcc accacgatat gaactttggt 18 00 

ggcagtgcca aacacagtgt gcgtgaactg 1860 

atccgcacct tcagctcccg gggggacatg 192 0 

atgacgtfcct caccgtcaca gaatggtacg 1980 

cgtgtccggg aaaatatcac cctgggcgtt 2040 

agcagcgctg aaggttataa cagtcaggcc 2100 

2121 



35 Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atggcctttt ctcaagcggt tagcggatta aacgctgccg ccaccaacct cgatgttatt 60 

40 ggcaacaata tcgccaactc cgccacctac ggctttaaat caggcacggc ctcttttgcc 120 

gatatgtttg ccggttcgaa agtgggactg ggggtaaaag ttgccggtat cactcaggac 180 

tttaccgatg gcacgaccac caacaccggg cgaggtctgg acgttgctat cagccagaac 240 

ggttttttcc gtctggtaga cagcaacggt tcggtgttct acagccgtaa cggacaattt 3 00 

aagctggatg aaaaccgtaa cctgttgaat acgcaaggtt tacagctgac gggttacccg 3 60 

45 gtaaccggta cgccgccgac tattcagcaa ggggcgaatc cgaccaatat ttcgatcccg 42 0 

aataccctga tggcagcgaa aactaccacc acggcgtcga tgcagatcaa cctgaattcc 480 

agtgatccgc ttcctactgt tacgccattc agcgccagca atgcggatag ctataacaaa 540 

aaaggttcgg tgactgtttt cgacagtcag ggtaatgctc atgacatgag cgtctatttt 600 

gtgaagaccg gggataataa ctggcaggtc tacacccagg atagcagtga tccaaacagc 660 

50 attgcgaaga cagcgacaac actggaattt aatgctaatg gcacattagt ggatggtgcg 720 

atggcgaata atatcgcaac cggcgcaatt aacggtgcag aacccgccac gtttagtctg 780 

agcttcctca actccatgca gcaaaatacc ggcgctaaca acattgtggc aaccacccag 840 

aatggctaca aaccgggcga tctggtgagt tatcaaatca atgatgacgg tacggttgtc 900 

ggcaacaatt ccaacgaaca aacccaactg ctggggcaga ttgtactggc gaactttgcc 960 

55 aacaacgaag gtctggcatc cgaaggcgac aacgtctggt ctgcgacgca atcttctggc 1020 

gtggcgctgt tggggacagc cgggacggga aactttggca ctctgaccaa cggtgcgctg 1080 

gaagcgtcca acgtcgatct cagtaaagaa ctggtcaata tgatcgttgc ccagcgtaac 1140 

tataagtcta acgcccagac catcaaaacc caggaccaga tcctcaacac gcgggttaac 12 00 
ttacgctaa 

60 <212> Type : DNA 

<211> Length : 1209 

SequenceName : SEQ ID 567 
SequenceDe script ion : 



1209 



65 Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
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<400> PreSequenceString : 
atgaaactcg tgcacatggc cagtggttta 
gataaaagcg cggatattca gacgccagcc 
caacaacctg ctatccagca accgaatgtc 
gcactgccgc ctgatgctgt gctgaccgtg 
ccgtcaaaag tgctggcgca gaaagcggtg 
tttgttctgc catttaaccc ggcagatgtt 
gcgattaccg tgaatgacaa actggtattt 
cagggcggaa ctaaagccga cctgacattg 
caggccagcg gtggcgcaac gactaccgta 
tcttcggcag ttcccgctcc tacgcaatat 
<212> Type : DNA 
<211> Length : 573 

. _ SequenceName : SEQ ID 568 
SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 245 7T 
<400> PreSequenceString : 

atgatcataa aaaaaagcgg tggtcgctgg cagctaagcc tgctggcgag cgtggtaatc 60 

agtgcctttt ttctcaacac agcttacgcc tggcaacaag aatatatcgt tgatacgcaa 120 

cccggacatt ccacagagcg ttacacctgg gatagtgatc atcaacctga ttacaacgat 180 

attttgtcgc aacgtattca aagtagccaa agggcgctgg gactggaagt caatctggcg 240 

gaagaaactc ctgtggatgt gaccagcagt atgagtatgg gctggaattt tcctttatat 3 00 

gaacaggtta caaccggccc ggtcgcggca ttacattacg atggcacaac cacctcgatg 3 60 

tataacgagt ttggcgacag tactaccacg ctgaccgatc cgttatggca tgccagcgtg 420 

agtagcttag gctggcgtgt tgactcccgg cttggcgatc tccgaccctg ggcgcaaatc 480 

agctataacc agcaatttgg cgagaatatc tggaaggcgc aatcaggcct gagccggatg 540 

acggcgacaa accagaacgg caactggctg gatgtcaccg taggcgctga tatgttgctc 600 

aatcaaaata ttgccgccta tgctgcgcta actcaggcag aaaataccac taataatagc 660 

gactatctgt atacgatggg ggttagcgcc agattttaa 699 
<212> Type : DNA 
<211> Length r 699 

SequenceNarae : SEQ ID 569 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaatggt gcaaacgtgg gtatgtattg gcggcaatgt tggcgctcgc aagtgcgacg 60 

atacaggcag ccgatgtcac catcacggtg aacggtaagg tcgtcgccaa accgtgtacg 12 0 

gtttccacca ccaatgccac ggtagatctc ggcgatcttt attctttcag tcttatgtct 18 0 

gccggggcgg catcggcctg gcatgatgtt gcgcttgagt tgactaattg tccggtggga 240 

acgtcgaggg tcactgccag cttcagcggg gcagccgaca gcaccggata ttataaaaac 3 00 

caggggaccg cgcaaaacat ccagttagag ctacaggatg acagtggcaa cacattgaat 3 60 

actggcgcaa ccaaaacagt tcaggtggat gattcctcac aatcagcgca cttcccgtta 42 0 

caggtcagag cattgacggt aaatggcgga gccactcagg gaaccattca ggcagtgatt 480 

agcatcacct atacctacag ctga 504 
<212> Type : DNA 
<211> Length : 504 

SequenceName : SEQ ID 570 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString ; 

atgaaaagag cgcctcttat aacaggactt ttgttgatat ccacatcctg cgcttatgcc 60 

tcctcaggag ggtgtggagc cgacagcact agcggtgcga caaattacag cagtgtggtt 120 

gatgatgtta cggtgaacca gacagataac gtgacaggac gggagtttac ctctgcaacg 18 0 

ctaagtagca ctaactggca atacgcctgt tcctgctctg cgggtaaggc agttaaactt 240 

gtctatatgg tcagccccgt acttaccacc actggacatc agacaggata ttacaaactc 3 00 

aatgacagcc tggatattaa aaccacatta caggcaaacg acattccagg actcacaacc 360 

gaccaggttg tctctgttaa cacccgattc acacagataa aaagcagcac cgtatattct 420 

gctgcaaccc aaacgggtgt ttgccagggt gacacgtctc gttatggacc cgttaatatt 480 



gcggttgcga ttgcgttggc 
ccggctgcaa atacgtctat 
tccggtactg tctggatccg 
acactttctg acgcgtcgtt 
cgtactgaag gtaaacagtc 
cagccgaacg cgcgtattct 
atcaccgata ccgttcagcc 
gtgccggtac agcaaaccgc 
ccttcgactt caccaactca 
taa 



ggcttgcgca 60 

ttcagcaaca 120 

tcagaaagtc 180 

agccgatgca 240 

accattcagc 3 00 

gttgagtgcg 3 60 

ggtgatcaac 420 

cgtgccggtt 480 

ggtgaatccg 540 
573 
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ggtgcaaata ccacctttac cctgtatgtc accaagccat ttctcggctc gatgaccatt 540 

ccgaaaacgg atattgccgt cattaaaggc gcgtgggtcg atggaatggg aagcccgtct 600 

acaggtgact tccatgattt agtcaagtta tcgattcagg gaaatctcac cgccccacag 660 

tcgtgcaaaa ttaatcaggg cgatgttatt aaagttaatt ttggattcat caatggtcag 72 0 

5 aagtttacca cccgcaatgc catgccagac ggttttactc cagtagactt tgatatcact 780 

tatgactgtg gtgatacttc aaagattaaa aactcgttgc aaatgcgcat cgacggtaca 840 

actggggtag tagaccagta caacctggtc gccaggcgaa gaagttcaga caatgtgccc 900 

gatgtcggta ttcgtattga aaatctcggc ggcggagttg caaatattcc ttttcagaac 960 

ggtatccttc ccgttgatcc ttccgggcat ggcaccgtca atatgcgcgc ctggccagtt 102 0 

10 aatctggtcg gtggtgagct ggaaacagga aaatttcagg gcacagccac cattaccgtc 1080 
atggtgcggt aa 1092 
<212> Type : DNA 
<211> Length : 1092 

• SequenceName : • SEQ ID 571 

15 SequenceDe script ion .- 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

20 <400> PreSequenceString : 

atgcaaaaaa acgctgcgca tacttatgcc atttccagct tgttggtgct ttcactaacc 60 

ggctgcgcct ggataccctc cacgccgctg gtgcaggggg cgaccagtgc acaaccggtt 120 

cccggtccga cgcccgtcgc caacggttct attttccagt ctgctcagcc gattaactat 18 0 

ggctatcaac cgctgtttga agatcgtcga ccacgcaata ttggcgatac gctgaccatc 240 

25 gtgttgcagg agaacgtcag cgccagcaaa agctcctctg cgaatgccag ccgtgacggt 3 00 

aaaactaatt ttggctttga tactgtgccg cgctatttgc aggggctgtt tggtaacgct 3 60 

cgtgccgatg tcgaagcctc cggtggtaac acgttcaacg gaaagggcgg ggccaatgcc 42 0 

agcaatacct ttagcggcac gttgacggtg acggttgacc aggtactggt caacggcaac 4 80 

ctgcatgtgg tgggtgaaaa acagattgcc attaatcagg gtaccgaatt tattcgcttc 540 

30 tcgggcgtgg ttaatccacg cactatcagc ggcagcaata ccgtaccgtc tactcaggtg 600 

gcggatgcgc gtattgaata cgtaggcaat ggctacatta acgaagcgca aaatatgggc 660 

tggttgcagc gtttcttcct taacctgtcg ccaatgtaa 699 
<212> Type : DNA 
<211> Length : 699 

35 SequenceName r SEQ XD 572 

SequenceDescription : 

Sequence 



40 <213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaacgac atctgaatac ctgctacagg ctggtatgga atcacattac gggcgctttc 60 

gtggttgcct ccgaactggc ccgcgcacag ggtaaacgtg gcggtgtggc ggttgcactg 12 0 

tctcttgccg cggtcacgtc actcccggtg ctggctgctg acatcgttgt gcacccgggt 180 

45 gaaacagtga atggcggaac actggtaaac catgacaacc agtttgtatc cggaacagct 240 

gatggcgtga ctgtcagtac cgggcttgag ctggggccgg acagtgacga aaacaccggc 3 00 

gggcaatgga taaaagcggg tggcacaggc agaaacacca ctgtcaccgc aaatggtcgt 3 60 

cagattgtgc aggcaggagg aactgccagt gatacggtta ttcgtgatgg cggagggcag 420 

agccttaacg gactggcggt gaacaccacg ctggataaca gaggtgagca gtgggtacac 4 80 

50 gggggaggga aagctgccgg tacaattatt aaccaggatg gttaccagac cataaaacat 540 

ggcggactgg caaccggaac catcgtcaac accggtgcag aaggtggtcc ggagtctgaa 600 

aatgtgtcca gcggtcagat ggtcggaggg acggctgaat ccaccaccat caataaaaat 660 

ggccggcagg ttatctggtc ttcggggatg gcacgggaca ccctcattta cgccggtggt 720 

gaccagacgg tacacggaga ggcacataac acccgactgg aggggggtaa ccagtatgta 78 0 

55 cacaacggtg gcacggcaac agagacgctg ataaaccgtg atggctggca ggtgattaag 840 

gaaggaggaa ctgccgcgca taccaccatc aaccagaaag aaagctgcag gtga 894 

<212> Type : DNA 
<211> Length : 894 
60 SequenceName : SEQ ID 573 

SequenceDescription : 

Sequence 



65 <213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgatgatga aaactattaa acatcttctg tgctgtgcca ttgccgccag cgcattaatt 60 
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tccaccgggg tgcatgctgc gtcctggaaa gatgcgctct 
ggcaaccaaa acagcacgac acaggaaggc ggttggtcgc 
cttagcagcg gaaaccaagc cttaagcgca gataacatga 
caatactgcg cgaagcaaaa gctggcttcg gtaaccgatg 
5 gtgctggaaa agctgggcct gaacagtgaa gagcaaaaag 
ggtattcagg gtttgctgaa aacaaaagat ggtcagcaac 
acgactccgc tggcagaaaa ggtgaaaacc aaagcctgcg 
ctgaacttca tttcctga 
<212> Type : DNA 
10 <211> Length : 498 

SequenceName : SEQ ID 574 

SequenceDescription : 

Sequence - 
15 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgtttaaag gacaaaaaac attggccgca ctggccgtat ctctgctgtt cacagcacct 60 
gtttatgcgg ctgatgaagg atccggtgaa attcacttta aaggtgaagt tattgaagca 12 0 

20 ccgtgtgaaa tacatcagga tgatattgat aaagaggttg aactcggtca ggtgaccacc 18 0 

agccacatta atcagtcaca tcacagcgat gccgttgctg tcgacctgct cttagtcaac 240 
tgtgatctgg aaaactccag caacggttcc ggtggcaaga tttctaaagt tgcagtcacc 3 00 

tttgatagct cagcgaaaac caccggcgca gatccgattc tcaacaacac cagcacgggt 3 60 

gaagccaccg gcgtcggcgt acgtttaatg aataaagatc aaagcaatat cgttttaggc 420 

25 accgctactc cagatatcga cctggctccg acctccagcg aacaaacgct gaatttcttt 480 
gcctggatgg aacaaattga tcaggctaca cctgtaacgc caggcgcagt tacggcgaat 540 
gcaacctacg tgctggatta taaataa 567 
<212> Type : DNA 
<211> Length : 567 

30 SequenceName : SEQ ID 575 

SequenceDescription : 

Sequence 



35 <213> OrganismName : Shigella flexneri 2a str. 2457T 

<400> PreSequenceString : 

atgagcgcgg gaagcccaaa attcaccgtt 

ctatggctgg caggctgttc tgacacttca 

ggcaatgcgc ctgcaaatac taattctggt 

40 acgacgtcta cagcgcagca accgcaaatt 

actcaacaac cgcaaatcca gccagtgcag 

aacggacgca tcgtctataa ccgtcagtat 

agtacctata ccgtgaaaaa aggcgacaca 

gatttccgtg accttgctca gcgcaacaat 

45 cagaccttgc aggtgggtaa tgcttccggt 

caggccgacg cagcagagca aggagttgtg 

gttgcgtcgc aaccgacaat tacgtattct 

atgttgccga acaacaagcc aactgcgacc 

gcaagcacaa ccgagccgac tgtcagcagt 

50 cgctggccga ctgagggcaa agtgatcgaa 

gggattgata tcgcaggcag caaaggacag 

gtttatgctg gtaacgcgct gcgcggctac 

gattacctga gtgcctacgc ccataacgac 

aaggcggggc aaaaaatagc aaccatgggt 

55 tttgaaattc gttacaaggg gaaatccgta 

<212> Type : DNA 
<211> Length : 1140 

SequenceName : SEQ ID 576 
60 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
65 <4 00> PreSequenceString : 

gtgattaaat ttctctctgc attaattctt ctactggtta cgacggcggc tcaggctgag 60 
cgtattcgcg atctcaccag tgttcagggg gtaaggcaaa actcactgat tggctatggc 120 



ccagcgccgc cagcgaactt 12 0 

tcgcgtcatt aactaacttg 18 0 

acaatgccgc aggcattctg 240 

ccgaaaacat caagaaccag 30 0 

aagacaccaa ctatctggac 360 

tcaatctgga taacatcgga 420 

atctggtgtt aaagcagggg 480 

498 



cgccgcattg cggctttgtc 
aatccaccgg caccggtcag 
atgttgatta cgccgccgcc 
cagccggtac agcagccaca 
ccagtagctc agcagccggt 
gggaacattc cgaaaggcag 
cttttctata tcgcctggat 
attcaggcac catacgcgct 
acgccaatca ctggcggaaa 
atcaagcctg cacaaaattc 
gagtcttcgg gtgaacagag 
acggtcacag cgcctgtaac 
acatcaacca gtacgcctat 
acctttggcg cttctgaggg 
gcaattatcg cgaccgcaga 
ggtaatctga ttatcatcaa 
acaatgctgg tccgggaaca 
agcaccggaa ccagttcaac 
aacccgctgc gttatttgcc 



actggtttcg 60 

ctccgttaat 120 

gaaaatgggg 180 

aattcaggct 240 

acagatggaa 3 00 

ttatagcggc 3 60 

tactggcaac 420 

gaacgttggt 480 

tgccattacc 540 

caccgttgct 600 

tgctaacaaa 660 

ggtaccaaca 72 0 

ctccacctgg 780 

gggcaacaag 840 

tggccgcgtt 900 

acataatgat 960 

acaagaagtg 1020 

acgcttgcat 10 80 

gcagcgataa 114 0 
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ttggtagtgg ggctggatgg cactggtgac cagacaaccc 

acgcttaata acatgctctc acagctggga attaccgttc 

ctaaaaaacg tcgctgcggt aatggtgaca gcgtcacttc 

caaaccatcg atgtggtggt ttcttccatg ggtaatgcca 
5 ttgttgatga caccgcttaa gggcgttgac agtcaggtgt 

attctggttg gcggcgcagg agcctccgca ggcggtagca 

aacggtggac ggatcaccaa tggtgcggtt attgaacgtg 

gtcgggaata cccttaattt gcaacttaac gacgaagatt 

gctgacacca tcaaccgcgt gcgtggatat ggcagcgcca 
10 attcaggtgc gcgtaccaag tggcaacagt tcccaggtcc 

aatatgcagg ttaatgtcac cccgcaggac gctaaagtag 

tcggtggtga tgaatcgcga agtgaccctc gacagctgcg 

tcagtaacag ttaatcgtca ggccaatgtc agccagccag 

cagactgttg ttactccaca aacgcagatc gatttacgcc 
15 agcgtacgtt ccagcgccag cctcaataac gtggtgcgtg 

acgccgatgg atctgatgtc tattttgcaa tcaatgcaaa 

aaactggaaa tcatctga 1098 
<212> Type : DNA 
<211> Length : 1098 
20 SequenceName : SEQ ID 577 

SequenceDescription : 

Sequence 



25 <213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaacgtt caattattgc tgccgctgtc ttttcttctt tttttatgag cgctggagta 60 
tttgctgcag acgttgatac cggaacatta accattaaag gaaatattgc agaatctccg 120 
tgtaaattcg aagcgggtgg tgattcagta agtattaata tgccgactgt accaaccact 180 

30 gtctttgaag gtaaagctaa atattctacc tatgatgatg cagtcggtgt aaccagcagc 240 
atgttaaaaa ttagctgccc gaaagaagtt gctggtgtaa aactctcgtt gatcacaaac 300 
gacaaaataa ccggtaacga taaggcgata gccagtagca acgataccgt aggtgataac 3 60 

agcgatgtcc tagatgtttc tgcacctttt aacattgaga gttataaaac agcggaaggt 420 
caatatgcta ttccgtttaa agcaaaatac ctgaaactga cagataactc agtgcaatca 48 0 

35 ggtgatgtgt tatcttctct ggttatgcgt gtggcgcagg attaa 525 
<212> Type : DNA 
<211> Length : 525 

SequenceName : SEQ ID 578 
SequenceDescription : 

40 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

45 atggcggttc aaaagaatgt tatcaaaggc atactggcag gtacgtttgc gctaatgctg 60 

agcggttgtg tcactgtgcc ggacgccatt aaaggcagca gtaccacgcc gcaacaagat 12 0 

ttagttcggg tgatgagtgc gccgcagctg tacgttggtc aggaggcacg ctttggtggc 180 

aaagtggttg cggtacaaaa ccagcaaggg aaaacccgcc tggaaattgc taccgtaccg 240 

ctggacagcg gagctaggcc gacgctggga gaaccttctc gcggtcgcat ttatgccgat 3 00 

50 gtgaacggtt ttctggaccc ggtggatttc cgtggacaac tggttacggt agtcgggcca 360 

atcaccggtg cggttgacgg caaaatcggc aatacgccct ataaatttat ggtgatgcaa 42 0 

gtaacgggtt acaaacgttg gcatttaacc cagcaggtga ttatgccgcc tcagccgatt 480 

gatccatggt tttatggcgg tcgtggctgg ccctatggct acggcggatg gggctggtat 540 

aatcccggcc ccgcgagagt acaaacagtt gtaaccgaat aa ' 582 

55 <212> Type : DNA 

<211> Length : 582 

SequenceName : SEQ ID 579 
SequenceDescription : 

60 Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgagaaaca aaccttttta tcttctgtgc gcttttttgt ggctggcggt aagtcgcgtt 60 

65 ttggctgcgg atagcacgat tactatccgc ggctatgtca gagataacgg ctgtagtgtg 120 

gccgctgaat caaccaattt tactgttgat ctgatggaaa acgcggcgaa gcaatttaac 18 0 

aacattggcg cgacgactcc tgtcgttcca tttcgtattt tgctgtcacc ctgtggtaat 240 



agacgccgtt taccacacaa 180 

cgacgggcac caatatgcag 24 0 

ctccgtttgg acgtcagggg 3 00 

aaagcttgcg tggaggtacg 360 

atgcgctggc gcagggcaat 42 0 

gtgttcaggt taaccaactg 480 

aattgcccag ccagtttggc 540 

tcagcatggc gcagcaaatc 600 

ccgcgttaga tgcgcggact 660 

gtttccttgc cgatatccag 72 0 

tgattaactc gcgcaccggt 780 

cggtagcgca gggaaatctc 840 

atacaccgtt tggtggtgga 900 

agagcggcgg ttcgctgcaa- 9 60 

cgctcaatgc gctgggcgct 102 0 

gtgcgggatg tctgcgggca 108 0 
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gccgtttctg ccgtaaaagt tgggtttacc ggcgttgcag atagccacaa. tgccaacctg 300 

cttgcacttg aaaatacggt gtcagcggct gcgggactgg gaatacagct tctgaatgag 3 60 

cagcaaaatc aaatacccct taatgctcca tcgtctgcga tttcgtggac gaccctgacg 420 

ccgggtaaac caaatacgct gaatttttac gcccggctaa tggcgacaca ggtgcctgtc 48 0 

actgcggggc atatcaatgc tacggctacc ttcactcttg aatatcagta a 531 

<212> Type : DNA 
<211> Length : 531 

SequenceName : SEQ ID 58 0 

SequenceDescription : 

Sequence 



<213> OrganismName :,. Shigella flexneri 2a str, 2457T 
<4 00> PreSequenceString : 

atgaaaaagt taacagtggc ggctttggca gtaacaactc ttctttctgg cagtgccttt 60 

gcgcatgaag caggcgaatt ttttatgcgt gcaggttctg caaccgtacg tccaacagaa 12 0 

ggtgctggtg gtacgttagg aagtctgggt ggattcagcg tgaccaataa cactcaactg 18 0 

ggcttgacgt ttacttatat ggcgaccgac aacattggtg tggaattact ggcagcgacg 240 

ccgttccgcc ataaaatcgg cacccgggcg accggcgata ttgcaaccgt tcaccatctg 3 00 

ccgccaacac tgatggcgca gtggtatttt ggtgatgcca gcagcaaatt ccgtccttac 360 

gttggggccg gtattaacta caccaccttc tttgataatg gatttaacga tcatggcaaa 42 0 

gaggcggggc tttccgatct cagtctgaaa gattcctggg gage tgc egg gcaggtgggg 480 

gttgattatc tgattaaccg tgactggttg gttaacatgt cagtgtggta catggatatc 54 0 

gataccaccg ccaattataa getaggeggt gcacagcaac aegatagegt acgcctcgat 60 0 

ccgtgggtgt ttatgttctc agcaggatat cgtttttaa 63 9 
<212> Type : DNA 
<211> Length : 639 

SequenceName : SEQ ID 581 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 
ttgtttttta agegaggaaa gattttgagt 
atcgtgatgt ttttatcggt tggactgeta 
gatacaggaa egtattcegg ctccgtttac 
atttcgegea ccacgggaac cagegtaaaa 
ccttacacca ttgaagttgg tcagaaacta 
acaegtaaat caaccgccaa atcaacgact 
gtaccgaaat cttcctggcc gecagtaggg 
aaagttatca tgeegtatte gacagcagat 
ecaeggggta cacctattta cgccgcgggt 
ctgcgtggct aeggtaatet catcatgatt 
geccataatg acacgatget ggtaaataat 
gecactatgg ggagcacgga tgeggcatet 
geaaeggcaa ttgatccget acgttacttg 

<212> Type : DNA 
<211> Length : 780 

SequenceName : SEQ ID 582 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atggcacaag tcattaatac caacagcctc tegctgatea ctcaaaataa tatcaacaag 60 

aaccagtctg cgctgtcgag ttctatcgag cgtctgtctt ctggcttgcg tattaacagc 12 0 

gcgaaggatg acgccgcggg teaggegatt getaacegtt ttacttctaa cattaaaggc 180 

ctgactcagg ctgcacgtaa cgccaacgac ggtatttctg ttgeacagae cactgaaggc 240 

gcgctgtccg aaatcaacaa caacttacag cgtatccgtg agetgaeggt tcaggcttct 3 00 

acegggacta actctgattc ggatctggac tccattcagg acgaaatcaa atcccgtctc 3 60 

gacgaaattg accgcgtatc cggtcagacc cagttcaacg gcgtgaacgt actggcaaaa 420 

gaeggttega tgaaaattca ggttggtgcg aatgaeggee agactatcac tattgatctg 480 

aagaaaattg actctgatac gctggggctg aatgggttta atgtgaatgg tggegggget 540 



gcgggacgcc tgaataaaaa 
ttggcgggct gttegggtag 
acegtgaaac ggggggatac 
gagctggege gaetgaaegg 
aaactgggtg gggcgaaaag 
aaaacegcat cggttacacc 
caacgttgtt ggttatggcc 
ggeggcaata aagggattga 
gcaggaaagg tggtgtatgt 
aaacacagtg aagattacat 
gggcaaagcg tgaaggctgg 
gttcgcctgc atttccagat 
ccgccgcagg gcagcaagcc 



atctctgggt 60 

caaatcatcc 120 

getatategt 180 

catttccccc 240 

tagcagtagt 3 00 

gtcatcagcg 3 60 

aacgacaggg 42 0 

tatctcagct 480 

gggcaaccag 540 

taeggcttae 600 

gcaaaaaatc 660 

tcgttaccgt 720 

aaaatgctga 780 
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gtggctaata ctgctgcatc taaagctgac ttggtagctg ctaatgcaac agtggtcggc 600 

aacaaatata ctgtgagtgc gggttacgat gctgctaaag cgtctgattt gctggctgga 660 

gttagtgatg gtgatactgt tcaggcaacc attaataacg gcttcggaac ggcggctagt 72 0 

gcaacgaatt acaagtatga cagtgcaagt aagtcatact cttttgatac cacaacggct 78 0 

tcagctgccg atgttcagaa atatttgacc ccgggagttg gtgataccgc taagggcact 840 

attactatcg atggttctgc acaggatgtt cagatcagca gtgatggtaa aattacggca 90 0 

agcaatggag ataaactcta cattgataca actgggcgct taacgaaaaa cggctctggt 960 

gcttctttga ctgaggctag tctgtccaca cttgcagcca ataataccaa agcgacaacc 102 0 

attgacattg gcggtacctc tatctccttt accggtaata gtactacgcc ggacactatt 1080 

acttattcag taacaggtgc aaaagttgat caggcagctt tcgataaagc tgtatcaacc 1140 

tcgggaaaca atgttgattt cactactgca ggttatagcg tcaacggcac aactggcgct 120 0 

gtaacaaaag gtgttgattc ggtttatgtt gataacaacg aggcgttgac cacatctgat 12 60 

actgtagatt tttacctaca ggatgatggt tcagtgacta acggcagcgg taaggcagtt 132 0 

tataaagatg ctgatggtaa attgacgaca gatgctgaaa caaaagcagc tacgacagcc 138 0 

gatcccctga aagctctgga tgaagccatc agctccatcg acaaattccg ctcctccctg 1440 

ggtgcggtgc aaaaccgttt ggattccgca gtcaccaacc tgaacaacac cactaccaac 1500 

ctgtctgaag cgcagtcccg tattcaggac gccgactatg cgaccgaagt gtccaacatg 1560 

tcgaaagcgc agattatcca gcaggccggt aactccgtgc tggcaaaagc caaccaggta 1620 

ccgcagcagg ttctgtctct gctgcagggt taa 1653 
<212> Type : DNA 
<211> Length : 1653 

SequenceName : SEQ ID 583 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgaaaaaaa ttgcatgtct ttcagcactg gccgcagttc tggctttcac cgcaggtact 60 
tccgtagctg cgacttctac cgtaactggc ggttacgcac agagcgacgc tcagggccaa 120 
atgaacaaaa tgggcggttt caacctgaaa taccgctatg aagaagacaa cagcccgctg 180 
ggtgtgatcg gttctttcac ttacaccgag aaaagccgta ctgcaagctc tggtgactac 240 
aacaaaaacc agtactacgg catcactgct ggtccggctt accgcattaa cgactgggca 3 00 

agcatctacg gtgtagtggg tgtgggttat ggtaaattcc agaccactga atacccgacc 360 
tacaaacacg acaccagcga ctacggtttc tcctacggtg ctggtctgca gttcaacccg 420 
atggaaaacg ttgctctgga cttctcttac gagcagagcc gtattcgtag cgttgacgta 480 
ggcacctgga ttgccggtgt tggttaccgc ttctaa 516 
<212> Type : DNA 
<211> Length : 516 

SequenceName : SEQ ID 584 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaagcgta atattatagg cggtgcattc actctggcat ctctaatgct ggccgggcat 60 

gcactggcag aagatggtgt tgttaacttc gtcggtgaaa ttgtcgacac tacttgtgaa 12 0 

gttacctccg atacagccga tcaaattgtc ccactgggta aagtcagtaa aaatgcattt 18 0 

tcaggtgtag gtagtctggc gtcgccacag aagttcagta ttaaacttga aaattgcccg 240 

gcaacgtaca ctcaagcagc cgttcgtttt gatggtacag aagcgcctgg cggcgacggc 3 00 

gacctgaaag tgggtacgcc gcttacagca ggcaaccctg gtgattttac cggtacagga 360 

caagcgattg cggcaaccgg cgttggtatt cgtattttta accagtccga taattcgcag 42 0 

gttaaacttt ataacgactc tgcttatacc gctatcgatg ctgaaggcaa ggctgaaatg 480 

aagtttattg cacgctatgt ggcaaccaat gcgaccgtaa cggctggtac ggcgaacgct 540 

gattcacaat ttactgtcga atataagaaa taa 573 
<212> Type : DNA 
<211> Length : 573 

SequenceName : SEQ ID 585 

SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaaaaaga gcactctggc attagtggtg atgggcattg tggcatctgc atctgtacag 60 
gctgcagaaa tatataataa agacggtaat aaactggatg tctatggcaa agttaaagcc 12 0 
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atgcattata tgagtgataa cgccagtaaa gatggcgacc agagttatat ccgttttggt 180 

tttaaaggtg aaactcaaat taacgatcaa ctgactggtt atggtcggtg ggaagcggag 240 

tttgccggaa ataaagcgga gagtgatact gcacagcaaa aaacgcgtcfc cgcttttgcc 3 00 

gggttgaaat ataaagattt gggttctttc gattatggtc gtaacctggg cgcgttgtat 360 

5 gacgtggaag cctggaccga tatgttcccg gaatttggtg gcgattcctc ggcgcagacc 42 0 

gacaacttta tgaccaaacg cgccagcggt ctggcgacgt atcggaacac cgacttcttc 480 

ggcgttatcg atggcctgaa cttaaccctg caatatcaag ggaaaaacga aaaccgcgac 540 

gttaaaaagc aaaacggcga tggcttcggc acgtcattga catatgactt tggcggcagc 600 

gatttcgcca ttagtggggc ctataccaac tcagatcgca ccaacgagca gaacctgcaa 660 

10 agccgtggca caggcaagcg tgcagaagca tgggcaacag gtctgaaata cgatgccaat 720 

aatatttatc tggcaacttt ctattctgaa acacgcaaaa tgacgccaat aactggcggc 780 

tttgccaata agacacagaa ctttgaagcg gtcgctcaat accagtttga ctttggtctg 840 

cgtccatcgc tgggttatgt cttatcgaaa gggaaagata ttgagggcat cggtgatgaa 900 

gatctggtca attatatcga-cgtcggtgct acgtattatt tcaacaaaaa tatgtcagcg 960 

15 tttgttgatt ataaaatcaa ccaactggat agcgataata agctgaatat taataatgat 102 0 

gatactgtcg cggttgggat gacgtatcag ttttaa 1056 

<212> Type : DNA 
<211> Length : 1056 

SequenceName : SEQ ID 586 

20 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

25 <4 00> PreSequenceString : 

atgcgtaagc agtggctcgg gatctgcatc gcggcaggaa tgctcgcggc atgtacaagc 60 

gatgatggtc agcaacagac ggtaagtgta ccgcagcctg cggtatgtaa cggccccata 120 

gttgaaatta gcggggcgga cccgcgtttc gaaccactga acgcgacggc aaatcaggat 180 

taccagcgcg acggtaaaag ctacaaaatc gtgcaggatc cgtctcgatt tagccaggcg 240 

30 ggactggcgg caatctatga tgccgaacca ggcagtaacc tgacggcctc tggcgaagct 300 

ttcgatccga caaagctgac ggcggcccat ccaacgcttc cgatccccag ctacgccaga 360 

atcactaacc tggctaacgg gcgaatgatc gtggtgcgca ttaatgatcg cggcccttac 420 

ggcaacgacc gcgttatttc gctttctcgc gcggcagctg accgtcttaa cacgtcaaac ~ 48 0 

aacaccaaag ttcgtatcga tccgattatt gtcgcccagg atggttcgct ttctggtcct 54 0 

35 ggtatggctt gtaccacagt cgccaaacag acttacgccc tgcctgcacc tcccgattta 600 

agcggtggcg cgggaacaag ttcagtgtct ggcccgcagg gtgacattct tccggtcagt 660 

aattcgacgc taaaaagcga agatccgacc ggcgcgccgg taaccagtag cggtttcctc 72 0 

ggcgctccaa cgaccttagc gcctggtgta ctggaaggca gcgaaccgac gcctgcgcca 780 

cagcccgttg ttacagcttc gtcgacaacg cctgcaacct cgcctgcaat ggtgacaccg 840 

40 caagccgcct cgcaaagcgc cagcggcaac tttatggtgc aagtcggggc cgtaagcgat 900 

caggctcgtg cgcaacagta ccaacagcaa ctgggacaga agttcggcgt ccccggtcgc 960 

gtaactcaaa atggcgcggt ctggcggatc cagcttggcc cattcgccag caaagccgaa 1020 

gccagtacct tgcagcaacg tttgcaaacc gaagcccaat tacagtcatt tattactacc 108 0 

gcgcagtag 1089 

45 <212> Type : DNA 

<211> Length : 1089 

SequenceName : SEQ ID 587 
SequenceDescription : 

50 Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaagaaaa aaacgatata tcagtgcgtt attttgttct ttagccttct taacatccat 60 

55 gtcgggatgg ctgggcctga acaagtcagt atgcatattt atgggaatgt ggtcgatcag 120 

ggctgtgatg tcgccaccaa aagtgcatta caaaatattc atattggtga ttttaatatc 180 

agtgattttc aggccgcgaa taccgtaagc actgctgctg atttgaatat tgatatcacc 240 

ggttgtgccg ctggtattac tggcgcggac gtccttttta gcggcgaggc tgacaccctt 3 00 

gcgccgacac tgctcaaact aactgacaca ggcggaagcg gtggtatggc aacggggatt 360 

60 gccgtgcaaa ttcttgatgc gcaaagtcag caagaaatcc cgctcaatca ggtccagcct 420 

cttacgccct taaaagccgg ggataacaca ctcaaatatc aacttcgtta taagtccaca 480 

aaggcgggag caacgggcgg taatgcgacg gcggttctct attttgatct ggtttaccag 540 

tga 543 
<212> Type : DNA 
65 <211> Length : 543 

SequenceName : SEQ ID 588 
SequenceDescription : 
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Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
5 <400> PreSequenceString : 

atgaaaaaca aattgttatt tatgatgtta acaatactgg gtgcgcctgg gattgcagcc 60 
gcagcaggtt atgatttagc taattcagaa tataacttcg cggtaaatga attgagtaag 120 
tcttcattta atcaggcagc cataattggt caggctggga ctaataatag tgctcagtta 180 
cggcagggag gctcaaaact tttggcggtt gttgcgcaag aaggtagtag caatcgggca 240 
10 aagattgacc agacaggaga ttataacctt gcatatattg atcaggcggg cagtgccaac 3 00 

gatgccagta tttcgcaagg tgcttatggt aatactgcga tgattatcca gaaaggttct 360 
ggtaataaag caaatattac acagtatggt actcaaaaaa cggcagttgt agtgcagaga 420 
cagtcgcaaa tggcaattcg cgtgacacaa cgttaa 456 
<212> Type DNA 
15 <211> Length : 456 

SequenceName : SEQ ID 589 
SequenceDescription : 

Sequence 

20 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

gtgatgaaat ttaaaaaatg tcttctgcct gtggcaatgt tagcgtcatt cactctggca 60 
ggatgccagt caaatgctga cgatcatgcc gccgatgttt atcaaaccga tcaactgaat 120 

25 accaaacaag aaactaaaac cgttaatatt atttccattc ttcccgcaaa agttgccgta 180 
gacaactccc aaaataaacg gaacgcacaa gccttcggcg cgcttattgg cgcagtcgct 240 
ggcggtgtta tcggccacaa cgtcggttct ggcagcaatt ccggaacgac ggcaggggca 3 00 

gttggcggcg gagctgtagg cgcggcagcg ggttctatgg tgaatgataa aaccttagtg 3 60 

gaaggtgttt ctttaacata taaggaaggc accaaagtgt atacctccac ccaggagggt 42 0 

30 aaagagtgcc agtttacgac aggtttagcc gttgttatta ccacaacgta taacgaaacg 480 
cgtattcagc caaataccaa atgtcctgaa aagagctaa 519 
<212> Type : DNA 
<211> Length : 519 

SequenceName : SEQ ID 590 

35 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

40 <400> PreSequenceString : 

atgcaaacga aaaaaaatga aatttgggtg ggtatctttt tattagcagc actgctggcg 60 
gcgctgtttg tttgcctgaa ggcggcgaac gtgacgtcca tacgtactga atcgacctac 120 
acgctttatg cgacgttcga taacattggc ggcctgaaag cgcgctctcc ggtcagcatt 180 
ggtggtgttg ttgtgggccg ggtggcggat attacgctgg acccgaaaac ctatctgccg 240 

45 cgcgtaacgc tggaaattga acaacgttat aaccacattc ctgataccag ttcgctgagc 3 00 

attcgtactt ccggcctgct gggagaacaa tatctggcat taaacgtcgg ttttgaagac 360 
ccggaactgg ggactgctat cctgaaggat ggcgatacaa ttcaggacac caagtctgcg 420 
atggtgctgg aagatctcat tggtcagttc ctttacggta gtaaaggcga tgacaataag 480 
aatagtggcg atgcgccagc tgctgcgcca ggtaataatg aaaccactga acctgtgggt 540 

50 acaacgaaat aa 552 
<212> Type : DNA 
<211> Length : 552 

SequenceName : SEQ ID 591 
SequenceDescription : 

55 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

60 atggcaccgt tagccttttc tgcacaatca ttggctgaat cattaacggt ggaacaacgc 60 

cttgagttat tagaaaaagc gttaagagaa acgcaaagcg aactcaaaaa gtataaagat 120 

gaagagaaga aaaaatatac gccagcgacg gtgaatcgta gcgtaagtac gaatgatcaa 180 

gggtatgccg ccaatccgtt cccgaccagt agtgccgcaa aacctgatgc tgtactggtc 240 

aaaaatgaag agaaaaatgc cagtgagaca ggctcgattt attcttccat gactctgaaa 3 00 

65 gatttcagta agtttgtgaa agatgaaatt ggctttagtt acaacggcta ttaccgttct 3 60 

ggttggggga ccgcctctca tggttcacct aaatcatggg cgattggttc tctgggccgc 420 

tttggtaacg aatactccgg ctggtttgat ttgcagttaa aacaacgtgt ctacaacgaa 480 
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aacggcaaac gggttgatgc cgttgtgatg atagatggta acgttggtca gcagtactct 540 

accggctggt ttggcgataa tgccggtggc gagaacttta tgcagttctc cgatatgtac 600 

gttaccacca aaggtttcct gccctttgcg ccagaggctg atttctgggt gggtaaacac 660 

ggtgcgccga aaattgaaat ccagatgctg gactggaaaa cgcagcgtac tgatgctgca 72 0 

5 gcgggtgtag gtctggaaaa ctggaaagtc ggtccgggta aaattgatat cgcgctggtt 78 0 

cgcgaagatg tcgatgatta cgatcgcagc ctgcaaaaca aacagcagat taatacccat 840 

actattgatt tacgctataa agatatcccg ttatgggata aagcgacctt aatggtgagt 90 0 

ggtcgttatg tcacggcaaa cgaaagcgca tcggaaaaag ataatcagga taataacggg 960 

tattatgact ggaaagatac ctggatgttc ggcacatctt taacgcagaa atttgataaa 1020 

10 ggtggcttca acgaattctc cttcctggtt gcgaataact ctatcgccag gaactttggc 1080 

cgttatgctg gcgcaagtcc atttaccacc tttaatggtc gttattatgg tgatcacacc 1140 

ggcggaacag cagttcgtct gacctcgcag ggcgaagcct atatcggcga tcattttatt 12 00 

gtggctaacg cgattgttta ctccttcggt aacaatattt atagctacga aacaggcgca 12 60 

cactctgatt ; tcgaatctat tcgtgcggtt gttcgcccgg cctatatttg ggaccaatat 13 20 

15 aaccagacag gtgttgaact gggctatttc acccagcaaa acaaagatgc gaatagtaat 13 80 

aaatttaatg agtctggtta taaaaccacg ctcttccata cctttaaagt caataccagt 1440 

atgttgacct cgcgtctgga aattcgtttc tacgccacgt atatcaaagc cctggaaaac 1500 

gaactggatg gcttcacctt cgaagacaat aaagacgccc agtttgctgt cggtgcccag 1560 

gctgaaatct ggtggtaa 1578 

20 <212> Type : DNA 

<211> Length : 1578 

SequenceName : SEQ ID 592 
SequenceDe script ion : 

25 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaaaaaaa gaattttatc agcagtttta gtgagtggtg taactcttag ttctgcgaca 60 

30 acattatcag ctgtaaaagc tgatgacttt gatgcgcaga ttgcgtcaca agattctaaa 12 0 

atcaacaact tgaccgcaca acagcaagca gcacaagcac aagttaatac gattcaagga 180 

caagtaagtg ctttacagac acaacaagct gaattacaag ctgaaaatca aagacttgaa 240 

gctcagtctg ctactttggg tcaacaaatt caaacacttt caagcaaaat tgttgcacgt 3 00 

aatgaatctt tgaagcaaca agctcgtagt gctcaaaaaa gtaacgcagc taccagctat 3 60 

35 attaatgcta tcattaattc aaaatcagtt tctgatgcta ttaatcgtgt ttcggctatt 420 

cgtgaagttg tatctgctaa tgaaaaaatg cttcaacaac aagagcaaga taaagcagct 480 

gttgagcaaa agcaacaaga aaatcaagca gcaattaata ctgttgcagc taatcaggag 54 0 

acaattgctc aaaatacaaa tgctttaaat acacagcaag ctcaattaga agcagcacaa 600 

ctaaacttgc aagctgaatt gactactgca caagatcaaa aagctacttt agttgctcaa 660 

40 aaagcggcag cagaggaagc tgcacgccaa gcagcagcag cacaagcggc agcagaagct 72 0 

aaggccgcag cagaagcgaa agctttacaa gaacaagcag cgcaagcaca agcagcagca 780 

aataataata ctcaagctac agatgtttct gaccaacaag cagcggcagc tgataacact 840 

caagcagcac aaacaggtga ttcaactgag cagtcagcag cacaagcag't aaataattct 900 

gatcaagaaa gtactacagc aacagaagca caaccatcag cttctagtgc ttcgacagct 960 

45 gctgtagcag ctaatacttc ttctgctaat acatatccag cagggcaatg tacttggggt 1020 

gttaaatcat tagctccttg ggtaggaaac tactggggta atggtggaca atgggcagca 1080 

agtgcagcag cggcaggata tagagttggt tctacacctt cagctggagc tgtagctgta 1140 

tggaatgatg gcggttatgg acacgttgct tatgttacag gtgttcaagg tggccaaatt 12 00 

caagttcaag aagctaacta tgcaggtaac caatctattg gtaactaccg tggttggttt 1260 

50 aatccaggta gtgtaagcta tatctatcca aactaa 1296 
<212> Type : DNA 
<211> Length : 1296 

SeguenceNatne : SEQ ID 593 
SequenceDescription : 

55 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<40 0> PreSequenceString : 

60 atgaaagtca aaaaaactta cggttttcgt aaaagtaaaa ttagtaaaac actgtgtggt 60 

gctgttctag gaacagtagc agcagtctct gtagcaggac aaaaggtttt tgccgatgaa 120 

acgaccacta ctagtgatgt agatactaaa gtagttggaa cacaaactgg aaatccagcg 180 

accaatttgc cagaggctca agggagtgcg agtaaggaag ctgaacaaag tcaaaaccaa 240 

gctggagaga caaatggttc aataccagtt gaagtaccta aaactgatct tgatcaagca 3 00 

65 gcaaaagatg ctaagtctgc tggtgtcaat gttgtccaag atgccgatgt taataaagga 360 

actgttaaaa cagctgaaga agcagtccaa aaagaaactg aaattaaaga agattacaca 420 

aaacaagctg aggatattaa gaagacaaca gatcaatata aatcggatgt agctgctcat 480 
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gaggcagaag ttgctaaaat caaagctaaa 
gatatggcag ctcataaagc cgaggttgaa 
acagcttatg aagctaaatt ggctcaatat 
aatgctgcca atcaagcagc ctatcaaaaa 
5 cgtgttcagg aagctaatgc agccgccaaa 
aatgctaaaa atacagaaat tgccgctgcc 
gccaaagctg aatatgagac taagttagct 
gaagctaatg ccgcaaacga agcagactat 
cttgctcgcg ttcaaaaagc caatgcggat 

10 gcaaataatg ccaaaaatgc ggcactcaca 
gagaatgcta aggcgactta tgaagctgca 
gtgaaaaaag ctaatgccgc aaacgaagca 
acagagctcg ctcgcgttca aaaagccaat 
gtagcagcaa ataatgccgc aaatgcagcg 

15 cgcaatgcgg atgctaaagc tgattacgaa 
gccaaatatc aaaaagattt agcagactat 
caagcttcta ttaaagctgc actggcagag 
ttaacagaac catctgctca aaatttggtc 
ttgacaacag atgggaagtt ccttaaggct 

20 acttcaaaag caaaatatga ccaaaaaatt 
ttagaacaat ctaatgatgt tgcttcttct 
gctggctggt caacgacagt aagcaataac 
gagcgcggtc aaagcgcaac agctacatac 
aaaaagattt ctaaaattgt ctacaagtat 

25 caaaaggttt ggttaggtat ttttaccgat 
acaggtcaag ttgaaaaaaa cacttctatt 
gaagatggaa aaccaattaa ttttgataat 
gaaaataatt ctattgagat ggccaaagat 
tcatctatcg gtgaaaagaa tggcatgatt 

30 ggtcaaggtg gtgctcgttg gaccatgtat 
gatagttcag atgcgcctaa ctcttggtat 
aataacagtg tgactttggg tgctatctca 
atggcaattg aaactggcaa aaaaccaaat 
gcggttaatg ttcctaaagt taccaaggaa 

35 ccaactaaac caacttatga aacagaaaag 
tatgaaaagg agccaacacc gccgacaagg 
acaccgccga cctatgaaac agaaaagccg 
gaagcagagc caacaccgcc gacaaggaca 
ccgccgacct atgaaacaga aaagccgttg 

40 gcagagccaa cgccaccgac accaacacca 
ccaacttatg aggttattcc aacaccgccg 
cctccatctg taccaactgt tcatttccat 
aacaaagaaa ttagaaacaa taacgatatt 
tctgttgtta agttccagct gaagacagca 

45 tcctttgtct tggtagatcc cctgccatct 
gctgcaagcc ctggctttga tgtcacttat 
gcaactgcag caactttggc tacgtttaat 
tatccaacag tggtcggaca agttcttaat 
ctcacagtca atgatgctta tggcattaaa 

50 aaaccaaatg atccagataa tccaaataat 
aacgaaaatg gcgttgttat tgatggtaaa 
gagctaactt gggatttgga tcaatataaa 
aaaggatttt actatgtaga tgattatcca 
gtgaagatta cagatgctaa tggtaatgaa 

55 aatcttgaag cagcccctca agaaattaga 
aaaggtgctt tccaaatttt ccgtgccgat 
aaaactggaa ttgatttgaa gattgtatca 
acaggcggca gttatgaaaa tcaagcttac 
aatatcatta tcaataatgt tcctaagatt 

60 gatccggctg atacaaataa tgttgatggt 
taccgtttga ttggtggcat tatccctgca 
ttttatgatg attatgatca aacaggagat 
aaggttgata tcacttttaa agacggttct 
tatacgacag cggaagttga taccgctaaa 

65 tttctgcgtt ctgtttcaat tgattcagcc 
cgtattgcgg ttggtacttt tgaaaatact 
agttcaaata cagtgaaaac aactactcct 



aatcaggcaa ctaaagaaca gtatgaaaaa 540 
cgcattaatg ctgcaaatgc tgccagtaaa 600 
caagcagatt tagcagccgt tcaaaaaacc 660 
gcccttgctg cttatcaggc tgaactgaaa 72 0 

gccgcttatg atactgctgt agcagcaaat 780 
aatgaagaaa ttagaaaacg caatgcaacg 840 
caatatcaag ctgaactaaa gcgtgttcag 9 00 

caagctaaat tgaccgccta tcaaacagag 960 

gctaaagcgg cctatgaagc agctgtagca 102 0 

gctgaaaata ctgcaattaa gcaacgcaat 10 80 

ctcaagcaat atgaggccga tttggcagcg 1140 

gactatcaag ctaaattgac cgcctatcaa 12 00 

gcggatgcta aagcggccta tgaagcagct 12 60 

ctcacagctg aaaatactgc aattaaaaag 1330 

gcaaaacttg ctaagtatca agcagatctt 13 8 0 

ccagttaagt taaaggcata cgaagatgaa 1440 

cttgaaaaac ataaaaatga agacggaaac 1500 

tatgatcttg agccaaatgc gaacttatct 1560 

tctgctgtgg atgatgcttt tagcaaaagc 1620 

cttcaattag atgatctaga tatcactaac 1680 

atggagcttt atgggaattt tggtgataaa 1740 

tcacaggtta aatggggatc ggtactttta 180 0 

actaacctgc agaattctta ttacaatggt 1860 

acagtggacc ctaagtccaa gtttcaaggt 1920 

ccaactttag gtgtttttgc ttccgcttat 1980 

tttattaaaa atgaattcac tttctatgac 2 040 

gcccttctat cagtagcttc tcttaaccgt 2100 

tatacgggta aatttgtcaa aatctctgga 2160 

tatgctacag atactctcaa ctttaggcag 222 0 

accagagcta gcgaaccggg atctggctgg 22 8 0 

ggtgctggtg ctatccgcat gtctggtcct 2340 

tcaacacttg. ttgtgcctgc tgatcctaca 2400 

atttggtatt ctttaaatgg taaaatccgt 2460 

aaacccacac cgccggttaa accaacagct 2520 

ccattaaaac cggcaccagt agctccaaat 2580 

acaccggatc aagcagagcc aaacaaaccc 2640 

ttggagccag cacctgttga gccaagctat 27 00 

ccggatcagg cagagccaaa taaacccaca 2760 

gagccagcac ctgttgagcc aagctatgaa . 2820 

gatcaaccag aaccaaacaa acctgttgag 28 80 

actgatcctg tttatcaaga tcttccaaca 2940 

tactttaaac tagctgttca gccgcaggtt 3 0 00 
aatattgaca gaactttggt ggctaaacaa 3 060 
gatctccctg ctggacgtga tgaaacaact 3120 
ggttatcaat ttaatcctga agctacaaaa 3180 
gataatgcaa ctaatacagt caccttcaag 324 0 
gctgatttga ctaagtcagt ggcaacgatt 3300 
gatggcgcaa cttataagaa taatttcacg 33 60 
tccaatgttg ttcgggtgac aactcctggt 3420 
aattatatta aaccaactaa ggttaataaa 3480 
acagttcttg ccggttcaac gaattattat 3 540 
aacgaccgct cttcagcaga taccattcaa 3 600 
gaagaagcgc ttgaattgcg tcaggattta 3 660 
gttactggtg ttagtgtgga taattatact 3 720 
gatgttcttt ctaaggcagg aattagacct 3780 
aatccaagag aattttatga tacttatgtc 3 840 
ccaatggttg ttaaaaaaca aatgggacaa 3900 
caaattgact ttggtaatgg ttatgcatca 3 960 
aaccctaaga aagatgtgac cttaacactt 402 0 
cagactattc cacttaatac agtctttaat 4080 
gatcactcag aagaactctt tgaatacaat 4140 
cactatactg gtcagtataa agtttttgct 4200 
attatcaagt caggtgctga gttaactcag 426 0 
ggtgctatca caattaagtt caaggaagcc 4320 
ttccaagctg aaagttatat ccaaatgaaa 4380 
tatattaata ctgtcaatgg ggtaacttac 4440 
gaggatccta cagaccctac tgatccgcaa 4500 
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gatccatcat caccgcggac ttcaactgta attaactaca aacctcaatc aactgcttat 4560 
caaccaagct ctgttcaaga aacattacca aatacgggag taacaaacaa tgcttatatg 4620 
cctttacttg gtattattgg cttagttact agttttagtt tgcttggctt aaaggctaag 468 0 
aaagattga 4689 
5 <212> Type : DNA 

<211> Length : 4689 

SequenceName : SEQ ID 594 
SequenceDescription : 

10 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgttaacgg aattaaaagc ggttttaaaa aagcctatgc tttggattac gafcggtagga , , 60 

15 gtagcccttg tccctgccct atataatatt atttttttga gttctatgtg ggatccttat 120 

ggcaaagtat ctgatttacc tgttgcagtt gttaataaag ataaaacggc aacttatgaa 180 

ggaaagaaga tgactatcgg taaagatatg actgataata tggtccgtaa taaaagtttg 240 

gactatcatt ttgttgatag cgaaaaagct caaaagggac ttgaaaaagg tgattactat 300 

atgatcatta ctttaccaga agatctttct caaaatgcgg ctagtgtttt aacagatgaa 360 

20 cctaaaaagc taacgattcc ttaccaaacg tctaaaggac atagttttgt tgcctccaaa 42 0 

atgagtgaaa ctgctgctaa gactttaaaa gagtctgtgt cgaaaaacat tacaagttct 480 

tacaccaaat cactttttaa gaatatgtca accctaaaaa caggtcttgg cagtgcggct 540 

aatgcaagtc aaaaaatagc gactggttca aaacagttag caaatggcag tcaagtgatg 600 

actgataatc tgaatttact ttcaaattca agtcaatcat ttgctcaagg gactaatacc 6 60 

25 ttatattcgg gcttaacagc ttatacggga ggtgtcggtc agctttctgc aggtttaaat 720 

aatttaaaca atggtttgac agcttataca aatggagttg gtcagttagc aaatggcagc 780 

agccaactga gcaatcaatc tcagaagctt ctaggaggtg ttgcgcaatt agcgaatggt 840 

tctgcttcta ttcaacaatt ggttaatgct agcagtcagt tgaatcaagg acttattaag 90 0 

ctgtcaacag caacaggtct ttctgaagaa caagttcaac agtttagctc attgattaat 960 

30 cagttgggaa ctttgaatca aagtattcaa aattacagtg ataatgggac agcaacgact 1020 

gcaaatagtc ctgatcttag tacgtattta tctgccatta caacagcagc tcaagcaatt 1080 

gttaattcag gaaatacgtc tcagcaaaca acaactaatc agtccaatgc attggctgca 1140 

gtccaagcta caggtgctta tcaaagatta tccgctgagg atcaatcaga gatcgctgca 12 00 

gctttggcca acactggtag ttcaacaaca actacaggtg ctgatgcaaa cgcagtgtca 1260 

35 caagcccaag ctatcctaaa caatgtccaa agtattcaaa gtgccttatc tactttacag 1320 

acaacaactg ctaatacacc aacaagccca tcagccagct tgactcaaat taaaaataca 13 8 0 

gctaattctg tattacctag tgcggcaact tccttgacaa ccttgtcaag tggtttgaca 144 0 

caagcaaaaa cagctcttga ttcgcaagta gtccctgtca gcacagccct tgctaatggt 1500 

acggctcaat taggctctac tttttcaaca ggcgccaatt ccttgatgac aggagtaggc * 1560 

40 caatacacta atgcagttga tattcttaat gcaggagcta atactttggc tgctaaaaat 162 0 

aaccagttaa cagatggtac aagtcaattg gtaaatggtg ccaatcaatt aaatagcaac 1680 

tctggacaat taacgaaagg gactgcacag ttagcaaatg gtgctaatca aatagagaca 1740 

ggagctggca aattggctgc aggtggagaa agtttaacgg cgggcttgac aactttgtca 18 00 

agtggtagtg gtgaactgtc gaaagcattg tcaactgcta aaaataaatt atcattagta 1860 

45 gcagttgaca atgataatgc taagactttg tcaagtcctg tcactatcaa gcatacagat 192 0 

aaagataatg tcaaaacaaa tggtgttgga atggctcctt atatgatgtc agcagcttta 198 0 

atggtaatgg caatttcaac caatactatc tttagagtag cactttctgg taagcaagct 2040 

aaaaccctac gagaatggat agatcaaaag ttagcagtca atggcttgat tgctgttact 2100 

ggagctatta ttctctattt tggtgttcat attattggtt tatcagctaa ttttgaacta 2160 

50 aaaactttag gattaattat tcttaccagt atcactttta tggtcttagt gacaactttg 222 0 

gtaacttggc atgataaatt tggctctttt gctgccttaa ttttactttt actacaatta 2280 

ggttccagtg caggaaccta tcccttagcc gttacagata agtttttcca agttgtcaat 2340 

ccttatttac caatgagtta ttcggtttct ggtttacgag aaaccatttc tatggctggt 240 0 

acaattggta atcaactact agcgctgagt ttatttttcc ttacttttgc tgctttagga 2460 

55 ttgttaatcg ctcgaaggag gattaggtct gtcaaagtag cctaa 2505 
<212> Type : DNA 
<211> Length : 2505 

SequenceName : SEQ ID 595 
SequenceDescription : 

60 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 
65 atggtatccc aaaaaaataa atctaaaaag ggtcaatcta aaacgtttac cttaatttca 60 
aatagaatta atctcctatt ttttttgatt gtcgctttgt ttactgtttt gcttttgagg 120 
ttagctcaga tgcagcttta tgatgcaaag ttttacaaat ctaaattgac agagtcaaca 180 
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acatatacta taaaaacctc cagccctcgc ggacaaattt atgacgctaa aggggtggct 24 0 

ttagttgaaa acgaggttaa agaggttgtt gcctttacaa gaagcaatac catgactgcc 300 

aaggatatta aagcaaatgc aaaaaagtta gcagatatgg tgactttaac tgaatctaag 3 60 

gtaaccaaac gtcagaaaaa agattactat ctggcagatc caaaaaatta tcaaaaaatc 420 

5 gttaaaaaat taccaaataa taaaaaatat gataactttg gaaataacct aacggaatcg 480 

aagatctatg caaatgctgt taaagcagtt ccaaacagtg ctattgatta ttctgaagat 540 

gagaaaaaaa tcattcatat tttcagccag atgaatgcaa cctctgtttt taatacagct 600 

tcgttaacaa caggagattt aacagctgag caaattgcag tattggctac aagcaaatca 660 

gatttaaagg gaatttctgt taagactgac tgggaacgta aaacagataa aaattccatt 72 0 

10 acttctatta tcggtaaagt ttccagtcaa aaaaccggtc tgcctgctga agaagccaat 780 

aactatgtta aaaaaggtta ttctctgaat gaccgcgttg gaacatctta tcttgaaaag 840 

caatatgaaa acgatttgca aggtagtcgt actgttcaag caatcaaggt taacaaagaa 900 

ggtaaaatta tcagtgataa gaccactgcc aaagggacta aaggaaaaaa tctgaagttg 960 

acccttgatc ttgaatttca aaaaggtgtt ,gaacaaattc ttaaccaata ttttaattct 1020 

15 gaattagcat ctggaaatac caagtattct gaaggcgttt atgctgttgt tcttaacccg 1080 

aacacaggtg cagttctttc tatggctggt ttggaacacg accttaaaac gggcgaagta 1140 

tcttctaatg ctcttggagc ggttactgaa gtctttactc cgggttccgt tgttaaagga 12 00 

gcgactttga cagctggttg ggaaaacggt gtcttatcag gcaatcaagt actcaatgac 1260 

cagcctattc aatttgcggg ttcaagccct attaactctt ggtttaccaa tggatcaact 1320 

20 cctcttacag caagtcaatc cttagagtat tcttctaata cttatatggt tcaattggct 13 80 

ttaaaattaa tgggacaaga ttatcatagt ggtatgacct tatcaacaga tggctafcaag 1440 

gaagctatgg agaaattaag agctacttat gctcaatatg gtttgggtgt ctcaacagga 1500 

attgaccttc cgggagagtc aaagggctat acgccagaac attatgatcc ttctaatgtt 1560 

ttaacagaat catttgggca gtttgataac tatacggcta tgcagctggc acaatatgct 1620 

25 gcagctgtcg caaatggtgg taaacgtatt gctccccatt tggttgaagg tatctatgat 1680 

aataataaaa caggcggttt aggaaatctt gttcaatcta ttgacaccaa ggttttaaat 1740 

aatgtttcta tctctagcga tgacatgggg attatcaagg aaggtttcta taatgttgtt 18 00 

aacggtggta gttatgcaac agggaagact cttgcaaaag gggcaagtgt tcccatttcg 1860 

gcaaaaacag ggacagccga agcttacgtg acaggagatg atggtaaatc tgtttataca 192 0 

30 tccaatttaa acgttgttgc ttatgcacca agcagcaatc ctcaaattgc tgtagctgtt 1980 

gtcttgccac atgagacaga ccttcatgga accactagtc atgctattac gagagatatc 2040 

atcaatcttt atcaaaagat gtatccaatg aatcagtga 2079 
<212> Type : DMA 
<211> Length : 2079 

35 SequenceName : SEQ ID 596 

SequenceDe script ion : 

Sequence 



40 <213> OrganismName : Streptococcus mutans UA159 
<400> Pre Sequenc eSt ring : 

atgacagttc taaaatatgg actaggtatt ctcttaagcg ctattatttt agccattata 60 

attggaggtc ttctgtttac ctattatgtc agcagtactc ctaaactatc agaagctaaa 120 

cttaaagcta ctaattctag tttggtttat gatagcaata ataatctgat tgctgattta 180 

45 ggtgctgaaa agcgcgaaag tatttcttca gacagtattc caatgaagtt agtaaatgcc 240 

gttacctcta ttgaagatca ccgtttcttt aaacatcgtg gtgtcgacat ttatcgtatt 300 

attggtgcag cttggagtaa tttacttcat aaatcaactc aagggggatc cactcttgat 360 

cagcagctta tcaagctggc ctatttctct actaaagagt ctgatcagac cttaaaacgt 420 

aaagctcaag aagtttggct gtctctacaa atggagaaaa aatacacgaa agaagagatt 480 

50 ctaacttttt atgtcaataa ggtttacatg ggtaatggga attacggaat gcgcactgct 540 

gcaaagtctt attatggcaa ggatcttaaa gacttatcaa ttgcccagct agcgacactc 600 

gcaggtattc cgcaagcacc gacacaatat gatccttacg ctcagccaaa ggcagctaca 660 

agcagacgta ataccgtttt gtcacagatg tataaacata aaaaaattac aaaacgagaa 72 0 

tatgatgctg cagtagcaac accaatttct gatggcctgc aagaactgaa acgctcctct 780 

55 agttatccaa aatatatgga taattatctg aaacaggtta tttcagaggt gaaaaaacgt 840 

actggtcaag atatcttttc agcaggcatg aaggtttata caaatgttaa tgccgatgca 900 

cagcaatatc tctggaacat ttataataca gatgaatata ttgcttatcc tgatgataat 960 

ttccaagttg cttctactgt tatggatgtt actaatggta aagttattgc acagcttggc 102 0 

ggacgccatc aagataccaa tgtttctttt ggtaccaatc aggctgtctt aactgatcgt 108 0 

60 gactggggat caaccatgaa acctatttca gcatatggcc ctgctcttga aagcgaagct 1140 

tttacgacaa ctgcacagat gctaaatgac tcggtctatt attatccagg tacaacaaca 120 0 

caagtctatg actgggatca tcgttataat. ggttggatga ctatccaaac ggctatccaa 1260 

caatctcgta atgtccctgc tgtcagagct attgatgccg ctggattaga tactgccaaa 132 0 

ggtttcttaa gcggtctcgg tattgattat cctgagatgc gttattcaaa cgccatttca 138 0 

65 agtaatacaa gtagttcaga acaaaagtat ggtgccagca gtgaaaaaat ggccgccgct 1440 

tatgctgctt tttctaatgg tggaacttat tatgaaccac aatacgtcaa taaaatagaa 1500 

tttaaggatg gaacatcaga gacctatgat gctaaaggca atcgtgcgat gaaagaaacg 1560 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



60 



65 



acagcctaca tgatgacaga tatgttaaaa 
gctgctattc ctggtcttta tcaagcaggt 
gaattggtag agatgtctga aaaacttggt 
ccagatgaaa actttgttgg ttatacacct 
aaaaatcgct taatgccfcgt ttacggagac 
actatgatgg cttatctttc tagctcaggt 
tatcgcagcg gtggttatct ttacctaaat 
gcagctcctg caacttcatc gtcatcttct 
gatcaaaata ataatcaaac tacagaagcg 
cttctaatcc 
: DNA 



acagtattaa 
aaaacaggaa 
atfcaatcctt 
cagtattcaa 
agtatgaaaa 
aattctgatt 
ggttcaagtg 
tcatcatctt 
tctagtgact 



catatggtac 
catccaacta 
atggacttgg 
tggctgtttg 
ttgctgcgca 
ggaccatgcc 
ggtcaaatag 
cttctgatfcc 
catcttcatc 



tggtactgag 
tgatgacaat 
tactattgct 
gacaggatat 
agtctatcgt 
tgacggtctc 
taggtatggt 
aaacaataac 
aagttctgat 



ataa 



gctacgacat 

<212> Type : 

<211> Length : 2124 

SequenceName : SEQ ID 597 
SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaaatcga aaactgctaa aattactttg ctaagcagcc ttgctttggc 
gcaacgaatg tttttgcaga tgaagcatca actcaattaa attctgatac 
cctactgctg atacacaagc atcagaaccg gctgcaacag aaaaagaaca 
gtagctgttg tcgaaagtca cacacaagga aatacaacaa cgacaacatc 
tctaaagaat tggaagatgc taaggctaat gctaatcagg aaggtttaga 
actgaagctc aaaaacagcc ttcggtagaa gctgcagatg cagataacaa 
caaacaatta atacagcggt agctgattat caaaaggcaa aagctgaatt 
caagaacaat ataataaaga ttttgaaaag 
caaaaggcag cttacgagca atataaaaaa 
gttgaaaaag cccaaggact tgtgtttatt 
ggtgttaatc agtacctaac aaaagaagca 
cagcaatata atactgataa ttatacagct 
ccaaaagaag atacttggtt caaaatgaaa gtgggagatc agatttcagt 
aatatcgtta attcaaaata taatgataaa aagattagta aggtaaagat 
ctcaatagtt caacgaataa tgaaggcagt 
aagacaattt tcattggtgc acagacatct 

tgatgaaaat 
gttaaaccat 
tgaattcgtt 
taaggacaac 
ggatgctgtc 
tatgacttac 
aacaaccatt 



tatcagtctg atgtcaagga 
gaagttgcac aaggtttggc 
aatgaacctg aggcaaaact 
cgtcaaaaac atgcaactga 
tctgatttta cccaagcaaa 



acgatgcaaa ttatttttta 
gccattatga gtctctcatc 
aaagtaaacc ttggggataa 
ggcaatgaaa tctattcggc 
ggtgatggag cagatggctg 
gcttattatg gtgcaggtgc 
ggtaatgatc aaaacttacc 
cctaaagatc cgggagctaa 
gtgacttggc ataaaaatct 
ccaccaacaa ctcctgatga 
tcacctgtcg tagctaagtc 
agagagcgtg attatcaacc 
gctactcttg gtgctatttc 
aaagaaaact ag 
<212> Type : DNA 
<211> Length : 1752 

SequenceName : SEQ ID 
SequenceDescription : 



gcactggtca atttgttcca 
aatgctggca gaaatgataa 
ggcaatgaaa tcgatttaag 
tggacgacta agtatggcga 
aaaataccgg gctcatctgt 
caatataaag ctaatggtgc 
aatgctgatg gaacgccacg 
aagggagaac ccttcacctt 
tggtttgcga ctaattcagc 
accaacaccg ccagaaaaac cagagttgaa 
tgttgttgaa actaaaactg aggaagttcc 
accaacgcca gaaaagccaa aaacaccaga 
agtaagcttt agaacggcaa gaaaaggaga 
gactcttcca catgctgggg ctgctaaaca 
aactgcattt gctgcggcta ctttgattgc 



ggcttttgga 
tgttgcagca 
gtctcctgtt 
tcaagttact 
agtcactgaa 
agcacaggca 
tcctcaaaaa 
gtatgaagct 
atctgggcgt 
ttctattgag 
agatattctt 
tccatatgat 
tacctacgat 
taattatact 
tgatccaact 
aatcagtgtg 
cggcaataat 
tcatgtggaa 
tgacttacat 
aacctttaat 
tgctgcgacg 
tactgttggt 
tgtagctgtg 
aaaacctact 
tccagtgaca 
ggatcctcaa 
aatgcgtgtt 
aaatggttta 
agctagaaaa 



598 



55 Sequence 



<213> OrganismName : Streptococcus mutans TJA159 
<4 00> PreSequenceString : 

atggaacaga agatttttag caaacgaaaa agtaagattg ctgggctttg 
ttaacgacta cagttgttgc ccttgcgtca ggtactgtaa tcgaggctga 
gaacagcctg tcgcagctga gactgtctcg caagctgatg gggacaatcc 
acaagcgttc aacaagaaac tgctcctcaa caaacgaaaa cttctcaaag 
accgtagata gtgaagagtc agcaacttcc ccatctgatg aacagaccgt 
gattcaaact catcatctca aattgatcaa acgatagctg atacgaatcg 
gatcatattt caaaaacatc agccgctaca actgaagatc aagaagagaa 
gcaaaagcac aaactgctgc cgcaaccaac aatcaagaca ctcgttatag 
gcttatggca attccaattt taacaagaca ttaactgaat ttggaaaaaa 



tggagctatt 
tgagacaata 
cgaacaaaca 
ctcagacgca 
aagtcaaaat 
ctctgactct 
agttaattct 
tgcgaaagat 
tgctaatgtt 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040' 
2100 
2124 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1752 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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gctgatgtaa cctataatgg cgtgagggat gaatatattg tagttaacga tcctagtgct 600 

ccttacgttc ctaatgcaaa cgaaattgca aaatacttaa aggaatattt aacagaactc 660 

cgcaacatca ataatattgc tattccfcgtg ccttctgttg atcaggttat gcaaaaatac 72 0 

gcacaagatc gggctaacga agaagccaat gaaaaaaacg gcttggatca tgatactaat 78 0 

5 ttacctatcc ctaataattt aacttgggtt gccgaagatg gacatttgga tatggatagc 84 0 

agcattcaat ccaaaagtca agaaggctat acacttgctt ctgataaagc aaccgcctac 900 

tatctagcgc ttaactggtt ttctgactat tttaatattt acgatgaccc caacgatggc 960 

ctcaaatcgt ttggacacgc tgtcagtatt ttgtcagacg ggggaactgg aatgggctta 1020 

ggtcttgctt caggtcaaga taatgaaaag ggaatgtggt acgcacaatt ggaatttggt 10 80 

10 ggtaacgata acgaagataa taccaacgat ttttcatctt taaaaaacgg caagggagaa 1140 

tgggtattat attataaagg aagtcctgtt aagtttcttc ctaacactac cttttggtat 12 0 0 

gtaaaaaaag gcacttcccc tgatgcagct tctactcctc acaacagtga taaaccttca 12 60 

ttccagtcat ctaaagatct tgaccctaat ttcaaggccg ataatagatt ccaagaagga 1320 

aaggaagcct ctgttcatca^ggctattcct gcaacattta > aatctcatcg cgatgaagtt 1380.. 

15 ggtaataaag accaaaattc tctttctgct caactacctg atacaggagt tcaaaaaaat ' 1440 

aatcaattag ccttgatagc tttaggaaca ggcttgattt tactttccgg acttcttctt 15 00 

tcaaaaagaa aatccttaaa ataa 1524 
<212> Type : DNA 
<211> Length : 1524 

20 SequenceName : SEQ ID 599 

SequenceDescription : 

Sequence 



25 <213> Organi sraName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgacatttg aaaagcaaaa acactttagt ttacgtaaac taaaatttgg tttagtttca 60 

gttgcgatca tagctttctt atttgctgta acaaagactg cagaagctga cgagacagtg 12 0 

ataactgaac aaaggcaaac aagtaagatt aacgctagtt ctcaaaaagt ggagaatcag 180 

30 acttcaaatc aggtggaagc aaaaacggat agtgcaaaca aggatcctca agaaaaaaca 240 

ggaagtgttg caactgatgc cccttcaatg aattcagcta ataatatgag tcagtctgac 3 00 

aaacaaaata ctgttaatga aatatcttca gatagtcagc aaacaaaaac agatgaacaa 360 

actgatttac cgcaaaacag ctttaaacaa caatctgctc atgttaaaat gactactgaa 420 

gcagagaaga ccccatcaca ttcgattaat acctttgtta atgatggtaa tggtaattgg 480 

35 tattaccttg gtgctgatgg tagaaatgtt acaggcagtc atacgattgg tggcaagact 540 

atgtattttg ctcaagatgg taagcaagtt aaaggtgctt ttgctcaaga ttcggatgga 600 

aataaacatt attatgatag agatagtggt gagatgtgga ccaatcgctt tgtcaatgat 660 

caaggcaatt ggtattatct taataatgat ggtgtccctg tcaccggtag tattactgtc 720 

aatggtcagt ctctgtattt taattcggat ggtagtcagg ttaaagggaa ttttgttgaa 780 

40 gaagatggat ctttgcgtta ttatgataaa aattctggag atttactgag aaagacaagc 840 

cgaaccatta atggtgttaa ctaccaattt gataatgatg gaaatgcaag ggcgattgac 90 0 

aaaattgagg ttgttaagac cagtcttgta gttgatagtt atgaatttgg tccttctgtt 960 

tcaaagatta ttcttgagtt caatcataag gtaactcctg ctgttgttca tgctggtgca 1020 

atggtaacaa ctgctggagt tcaaagaaaa attcttaatt cttatgtctc taatgcttca 1080 

45 ggacatgtgg tttactttga tagtagccat tatgtgacac ttgaattaga tattccttat 114 0 

gatccaaatg atagcagccg aaatgcatca ccttttattt ttgactcagc agcctttcgt 1200 

aataactggg tcaacagtta tactgtcaaa gtagataatt tgcaggtgca agcagatggt 12 60 

tctaatagca gtcaaattat cagttcagag caagatgcta tcaataatcg tttcttgcct 132 0 

acaacggatc gtttctcaga acgtggtagt tatggtaatt ttaattatgc cgcctatcaa 1380 

50 ccagaagcag ctattggcgg tgagaagaat ccattgattg tctggttgca tggtatagga 1440 

gaagtaggca ctgatattaa tattccgctt ctagccagca atgtggctcg tttaacggaa 1500 

gatcctattc aaagccattt cacttctaca ggtagtggtg gtcaaaaggg agcctatgtg 1560 

ctagttcctc aaagttcaat tccttggtct caaaatcaaa cagctagctt aatggcgctc 162 0 

attaaagcct atgtagcaag ccatccagac atcgatagcc gacgcattta tttggcaggt 168 0 

55 gtttctaatg gtggcggtat gactctggat atgggagtcg cttatcctaa ctattttgca 1740 

gccttagttc ctattgctgc ttcctatagt aatcaattaa cagataatca gattaccgct 1800 

gctgctttga aagctctgaa aggtcaacca atgtggttga ttcatacacg aactgataaa 1860 

acaatatctg cagatagtag tgttctacca ttctataaag agttacttca agctggcgca 1920 

caaaataaat ggctttccta ttatgaaact aatgttggta aacatcactc tggagtcact 198 0 

60 tataacggtc actggtcttg gatttatttc ttgaatgatc aagtaactgg cactcaaaat 2 040 

actgataacg ccaagaattg gtctggactt tctggcatgg ttgcgaccaa tccaacctat 2100 

ggtggtgatg ctaaggctac tgtcaatggc agaacttata gtaatgtctt tgattggcta 2160 

aatggtcagc gaagaaggta a 2181 
<212> Type : DNA 

65 <211> Length : 2181 

SequenceName : SEQ ID 600 
SequenceDescription : 
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Sequence 



<213> Organ ismWame : Streptococci 
5 <40 0> PreSequenceString : 

atgaaaattt ttataaaaaa acaccaacaa 

ctaccaagtt ttataatgtt tctcgttcta 

acaactattt tagctagtga tggttttcat 

aatattctcc atggaacgga tagtttgttt 

10 atttttgctc tgacaagtta ttacttggga 

aatgtaaaaa atatggcaga tgctttttat 

ggtttatctg ctttttacag tcttgggcaa 

ttgatgctgt caacatctta tgctttaatg 

aattggttag atgtttttat cctgctacca 

15 gaaaaaaggg ggatttttct ttattttcta 

tattttggtt tcatgacagc tattttctta 

gatattagaa acagaatgaa acgattaagt 

ctgacaagtg cttttatgct gcttccaaca 

ttaacagaac aaattagtct attttcatca 

20 agtcttcttg gtagttatga tacgacaaaa 

ttacttccct tgatttttgc cattactttt 

aaagtagctt attttctttt attggctatt 

gatttatttt ggcaaggaat gcattcacct 

ttctccttag ttattgtcat aatggcagct 

25 ttgaaaaatt tttatccagc ctttaccttc 

ttcaaggact attataatta tctgacacaa 

gttagttatt ttattattct ttttactttt 

atttccttta cacttatctt tacaagtttt 

gaaggtattc aaactgactg gaatttccct 

30 gaaattgaca actatgttaa gaaaactaaa 

aaacaaattc cccaaactta caatgatggt 

ttctcatctg tcaaaaataa cttatcagca 

caaggaaatc attctaccat tagttatcct 

tcaattaaat acaatattaa taatcaaaat 

35 aaaaacaata agctgcaact ttacaaaaac 

aatcatattt acaaagatgt caagtttgac 

gttaatgaat tgacagatct aaatcttaca 

gtcggaatgc aagttttaga taatcgtgtt 

caagtttact atactgtaaa gtgtcctgca 

40 ttgacagtta ataataaaga cgaaaatgtc 

tatatcatag acgaaagtta ttatcttttt 

ttaatattta agcttagttt tccaaaaaat 

tatgctctcg atttaactgc ctatcaaaaa 

aaaacaacaa ctaagaaaaa taaaattttt 

45 ttgatttaca ctttaccata tgataaaggt 

aaaatatcta aagcacaaaa tggactaatg 

attataatga cttttgtgcc ccaaggacta 

atctttctct ttgtatttta ccaactttat 



is mutans UA159 

agtattcttt actatagtct tagttttctg 60 

ttctccaaaa atatttattg ggggagtagc 120 

caatatgtga tttttgatgc tctttttcgt 180 

tactctttta aggctgggct tggttttaat 240 

agttttttaa caccttttac ttactttttt 300 

ctcttcactt taatcaaatt tggtctaata 3 60 

atttatacta aaatctctaa atcactcgtt 420 

agctttacta gcagtcagtt agaattaaac 480 

cttattatgc ttggtttaca gcgtttagta 540 

actcttactt gtttatttat tcaaaattac 600 

actctttggt tttttacgca agtctcgtgg 660 

gattttgtgc tcgtatcaat ctttgcaacg 720 

tttcttgatt taaagagcca tggtgaagta 7 80 

gacatttggt atttcgattt ttttgctaaa 840 

tatggctcta ttccaacgat ttatatcggt 900 

ttctttgtta aatctataaa atggcaagtt 960 

attattgcaa gttttatctt tcaaccactt 1020 

aatatgtttt tgcatcgtta ttcttgggct 1080 

gaaacgttaa ctcggataaa ggatataaaa 1140 

ttgggagtag gacttttagc aactttttta 12 00 

gttaatttta tattaacaac tatcttttta 1260 

tttaatcaat tagtttctta taaagttatt 1320 

gaaatagctt taaatacttt ttatcaaatt 13 80 

tcaagagagg tttatgaaga taatgtaaag 1440 

aaagataact tagaattttt tcgaacagaa 1500 

atgaaattta attataatag catttctcag 1560 

caattattga attctctagg ctactattca 1620 

aataatacta ttttgatgga tagtcttttt 1680 

cctcataaat ttggattcca tttaaaacag 1740 

ttctattctc ttcctttagc acttatgtca 1800 

tcttatcccc ttgataatca acaaaaattt 1860 

cttttcaaag aaatccctat tatttcaagt 1920 

actattaatg gttcaaaagg aaataaggca 1980 

aatagtcaac tttatatcag ccttcctaac 2 040 

tttataacaa ctaacaagca cacaagttct 2100 

aatttaggaa attataaaaa aactcaaaca 2160 

aaaacggtta gttatgattt accacatatt 2220 

agtataaagc aattaaaaag tcaaactgtt 2280 

actacctatg ttgccaaaaa gagaacttcc 2340 

tggtttgcta aacaaaatgg aaaagcaatt 2400 

aaaattgatg tttctaaagg tagtgggaag 2460 

tatcaaggaa ttcttcttac ctgtctaggt 2520 

tacaaaaaat ttaatttaaa ataa 2574 



50 <212> Type : DNA 

<211> Length : 2574 

SequenceMame : SEQ ID 601 
SequenceDescription : 



55 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<40 0> PreSequenceString : 

atgaaattga aacatatttt aagaattgga gcggttgctt ttgcctcaat tcttttgtta 60 

60 actgcttgcg gatcaaaaac atctaaaaaa acagtaaccc ttgcgactgt tggaacaaca 12 0 

aatccatttt cttatgagaa aaagggaaaa ttgacgggat atgatatcga agttgctaag 18 0 

gaagttttca aagcttctga taaatacgat gtcaaatatc aaaaaacaga gtggaccagc 240 

attttctctg gtctagatag tgacaaatat caaatcggag ctaacaatat cagttatact 3 00 

aaagagcgtg ccaataaata tctttattct aatccaacgg cttccaatcc attggtatta 3 60 

65 gtggttccaa aagatagtga tattaagtct tataacgata ttgctgggca tagcactcaa 42 0 

gttgttcaag gaaatacaac agtgtctatg ctgcagaaat tcaataaaaa ccatgaaaac 480 

aatcaagtta aactaaactt taccagtgaa gatcttgcgc atcaaatccg gaatgtcagt 540 
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10 



15 



20 



25 



30 



35 



gatggtaagt atgattttaa aatttttgaa aaaatttcag cagaaacgat catcaaagag 
caaggacttg ataatttgaa agttattgat cttccttcag accaaaaacc atatgtttac 
tttatttttg cgcaagacca aaaagactta caaaagtttg tcaataaacg tctcaaaaaa 
ctttacgaga atggtacact tgaaaaatta tcgaaaaaat accttggagg aagctatctt 
ccagataaaa aagatatgaa ataa 
<212> Type : DNA 
<211> Length : 804 

SequenceName : SEQ ID 602 

SequenceDescription : 

Sequence 



<213> OrganistnName : Streptococcus mutans UA159 
<400> PreSeque^iceString : 
atgcgatttc ttgtctttct catcgcattt 
gaacggattg attcaaatac agttgctgta 
ttaaaaacaa atcaattaaa tgggatcatg 
gtattttcaa atcaaagcaa ggtagatggc 
cttgcttctt tacaaaaatt gataacaggg 
aaactgtctt taaacacacc tttaagcaaa 
atcacgatac aaaatttact tacccacaca 
cagcaagtgc tgacaactca agagcagcaa 
acttatcgaa aaaaatggaa gtatgctaac 
agtcaaatta gcggtcaaaa ttatgcgact 
aaggggtggc attttaaaaa gtatattcaa 
tcagtgatgg atcaaagtac gacttgggat 
ggagctggtg attatgcttc taggccagtg 
aatgaccaat ttgttcctgt cagcgaatac 
tattatggcg gcctctatat cagccaaaag 
acttactctt gttttgctta ttcaaatcct 
acaaacggta agtataaacg ggtcaaatcc 
gattcgtatg cgctgaggaa aaatgaaacg 
<212> Type : DNA 
<211> Length : 1059 

SequenceName : SEQ ID 603 

SequenceDescription : 

Sequence 



tttgctgctt 
aaccctgatt 
attgtgacgg 
agtcctgttt 
gtggctatcc 
tattatcctc 
agcggtttgg 
ttggattttt 
attaattatg 
tatgttcgtc 
ataaaagata 
aagctgtcaa 
gattattgga 
caacgttcta 
atgctgcatg 
aaaaccaaac 
ttagcagcta 
tcaaaataa 



tctataaatt 
cgctcatttt 
ggccagatgg 
caattaagga 
aacaattaat 
aaattgaaaa 
cagatcgaaa 
cattgaccaa 
ctttgctagc 
aacacttctt 
agtccaagtt 
aagaagtgac 
aatttatgat 
tgaaaatgac 
caaatggtgg 
aggtcatggt 
aagcctttaa 



tatcgagact 
aaagcgattt 
taaggctcaa 
ttattttcct 
tgataaagga 
tagtgaaaat 
agaagttcct 
ttatcgcgta 
tggcattatc 
aacagctggt 
agctgccttg 
atctaccttt 
ggcttttatt 
ttctaagagc 
tggctttgat 
tttgtttatc 
actatatgca 



600 
660 
720 
780 
804 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1059 



40 



45 



50 



55 



60 



65 



<213> OrganismName : Streptococcus mutans UA159 
<40 0> PreSequenceString : 

atgaagaaaa aaatagctct agcagctctt tcttttgtca gtgcagctgt 
tgcagctcag cacctggtgg ttcatcagat gcagctggta ataaaattgg 
aaaattggtt acaatcttga attatcagga gatgtagccg cttatggaca 
aacggtgcta accttgctgt tgaagagatt aataaggcag gcggcattga 
attaaagtta tctcaaaaga taataaatct gataacggtg aagcatcaac 
aatcttgcta cccaaagtaa agtaaatgct atcttgggac cagcaacatc 
gcggctgctg ctcccaatgc caacgatgct gcagtaccac tcgtaacgcc 
caagataatt tgacctattc aaaaggcaaa gttcaagatt acatcttccg 
caagatagct tccaaggaaa gatcattgcc aaatatgcaa cagataattt 
aaagtagcgc tttactatga taagtcaagt gattacgccc aaggtattgc 
aaaaaagcat ataaagggaa gattactgtt gaagatacct ttcaagctaa 
ttccaagcag ctctgaccaa gtttaaaaat aaagactttg atgccattgt 
tattatactg aaactggtct gattacaaag caagcacgtg atatggggct 
atcttaggac ctgatggttt taatgatgaa aaatatgttg aaggtgctgg 
accaataatg ttcattatgt atctggttac tcaacaaaag ttgctttaac 
gaaaaattcc tgaaagatta taaggctaag tatggtgaag agccaaatat 
cttgcttatg attccgttta tatgattgct gatgctgcaa aagatgccaa 
gatattgcaa caaacctagc taaattgaaa aactttaaag gtgtgacagg 
attgataaga aacataaccc tgttaaatca gccgttatgg ttggtcttaa 
gaagacacag ctactgctgt tgaagcaaaa taa 
<212> Type : DNA 
<211> Length : 1173 

SequenceName : SEQ ID 604 

SequenceDescription : 



tcttgcagct 
agatactgta 
agctgaaaag 
tggcaaaaag 
aatctcaact 
tggtgctaca 
ttctggaaca 
tacaactttt 
gaaagctaaa 
tgatgcattc 
agaccaagat 
gataccaggt 
tacccagcct 
tgcagccaat 
aaataaggct 
gtttgccgct 
aacatctaag 
taaaatgaca 
agatggtaaa 



60 
120 
180 
240 
300 
360 
42 0 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1173 



Sequence 
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65 



55 <213> OrganismName : Streptococcus mutans TJA159 
<400> PreSequenceString -. 



60 



240 
300 
360 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgaagaaat taagcttatt attactagtt tgtttatctt tattaggctt atttgcctgt 60 
5 acttctaaaa aaacagccga caaaaaattg actgttgtgg ctaccaattc tattattgct 120 
gatattacta agaatatcgc tggtaataag gttgtcttac atagtatcgt tcctgttggt 180 
cgagatcctc acgaatatga gcctcttcct gaagatgtta aaaagacctc tcaggctgat 
gtcatttttt ataatgggat taatcttgaa aatggaggca atgcttggtt taccaaacta 
gttaaaaatg ctcataaaaa gacagacaag gattattttg cagtgagcga tagtgttaag 

10 accatttatt tggaaaatgc aaaagaaaaa ggaaaggaag atcctcatgc ttggcttgac 420 
cttaaaaatg gtattattta tgctaaaaat atcatgaaac gtctatctga aaaagatcct 480 
aaaaacaaga gttattatca gaaaaatttt caagcctaca gcgccaaact tgaaaaacta 540 
cacaaagtag ccaaagaaaa aatcagtcgt atccctactg agaagaaaat gatcgtaact 60 0 

agngaaggtt gtttcaagta tttctctaag gcttacgata ttccttctgc ctatatatgg 660 

15 gaaattaata ccgaagaaga gggaacacca aatcaaatta aggctttagt gaaaaaatta 720 
aggaaaagtc gggtgtctgc gctttttgta gaaagcagtg ttgatgatcg tccaatgaaa 780 
actgtttcaa aagatacagg tatcccaatt gccgctaaaa tttttacaga ttcagttgct 84 0 

aaaaaaggac aggctggaga tagttactat gcgatgatga agtggaatat agataaaatt 900 
gcaaatggtc tgtcacaatg a 921 

20 <212> Type DNA 

<211> Length : 921 

SequenceNarae : SEQ ID 605 
SequenceDescription : 

25 Sequence 



60 
120 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgtttgttc atactaagac taagaaaaaa agaaagtggc aaaggaaagt gtttctactg 

30 ctgcttcttt ttttattgcc tattgtgtca gtattggctt ttattgtgtt atttattggc 

ggtggtacag ctgagtctca tgatgtggaa gcgacgacag ggggcgttaa gctttcagct 180 

aagcaatttg cagataagac aaagttagga atttcagaag aggaagctaa aaatgcctta 240 

gcttttgcgg ataggttgat gtctcgtcat cattttacag ctcaagcaac tgctggagta 3 00 

ttggctgttg gctttcgtga aagtggcttt gatgtcaaag cagttaataa ttctggtggt 3 60 

35 gtagctggct ttttccaatg gtctggctgg ggtagttctg ttaatggtga tcgttggaaa 420 

gtagctagta aaagagagtt aactctagag gttgaggtag atttgatgag cactgaacta 480 

gatggtcgat atgctgatgt tgtcaaaaaa gttggttctg cgactgatga aaaacaggct 540 

gctaaggatt ggtctcagta ttatgaaggt gttgcggtta gtgatggtca aacgaaagct 60 0 

gataaaattg agagttgggc aacaactatt tgtgaggctt taaagtctgg tggtacaaat 660 

40 tatgctaaag tgaataatac gggaacaagt tctactgcta tcccgcaggg ttgggaaaat 720 

attagtgctt ttgatggcca tgcttatgaa ggtagtgaaa attatcctca aggacaatgc 780 

acttggtatg tttataatcg tgctaaacag ttgggtgtta gcttcagtcc ttatatgggg 840 

aatggcggtc agtggtatca agtgcaaggc taccattcta gtcatacacc taaagcacat 900 

acggctttat cttttgtcaa tggtcaggca ggttctgatc caacttatgg tcatgttgct 960 

45 tttgtagagg ctgttaaaga tgatgggagt attctaatca gtgagatgaa cgtttatggt 1020 

caaccagcta tgacggttgc ctatcggaca tttgatgctg aaactgctaa acaattttgg 10 80 

tatgtagagg gaaaataa 1098 
<212> Type : DNA 
<211> Length : 1098 

50 SequenceName : SEQ ID 606 

SequenceDescription : 

Sequence 



ttactatttt 


attgggagct 


60 


attatacaaa 


tacggcttct 


120 


ctaaggctgt 


taattattgg 


180 


aagtcagtat 


taaggctggg 


240 


cgactgctct 


aactttcgat 


300 


ctgatcctaa 


tacaggagaa 


360 


atgataaggc 


tatgacatta 


420 


ctggtgttgt 


aaactttaac 


480 


atattacgcc 


aataagtcca 


540 


gtttgattca 


ctggaaagtt 


600 


tgactgatgt 


tgtcggagaa 


660 
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gatcaagaaa tcgtaaaaga ttccttggtt gctgcacgct tgcagtacat tgctggtgat 72 0 

gatgttgaca gtttagatga agctgcttcg cgaccttatg ctgaggattt ttcaaaaaat 780 

gttacttatc aaactaatga tttaggattg acaacaggat ttacctatac aattccagga 84 0 

tccagtaaca acgctatctt tatctcttat actactcgtt taacttcttc tcaatctgct 900 

5 ggtaaagatg tcagcaacac tattgctatt tcaggaaata atattaatta ttccaatcaa 960 

acaggctacg ctcgtattga atccgcatat ggtagagcta gttctagagt aaagaggcaa 102 0 

gcagaaacaa caactgttac tgaaacaaca acttcgtcat cttctgaaac gacaactagt 1080 

gaagcgacaa cagaaacaag tagtacaaca aataataatt caactactac agaaacagct 1140 

actagcacaa caggagcttc aacaacacaa acaaaaacga ctgcttctca aacgaatgtt 12 00 

10 ccgacaacaa caaacataac aacaacttca aaacaagtaa ccaagcaaaa agcgaaattt 12 60 

gttttaccat caacaggtga acaagcaggg cttttgttaa ctactgtagg tcttgtaatt 1320 

gttgctgtgg caggtgtcta tttctataga acacgtcgtt aa 1362 
<212> Type : DNA 
<211> .Length, ; 1362 . 

15 SequenceName : SEQ ID 607 

SequenceDe script ion : 

Sequence 



20 <213> OrganistnName : Streptococcus mutans TJA159 
<4 00> PreSequenceString : 

atgacattta aaaagttagt tttaggtttg ttgagttttg tggctgtatt tactttagta 60 

gcttgcagtt cttctaattc aaaaaattta caggatgata ttaaagaaaa gaaaaagtta 12 0 

gttgttgctg ttagtccgga ctatgctcct tttgagttca aggctcttgt gaacggtaag 180 

25 gatactgttg ttggtgctga tattgatttg gcaaaagcaa ttgctaaaga attgggagtg 240 

aaactggaat tatcttccat gagttttgat aatgtcttgt ccagtttaaa aacaggaaaa 300 

gcagacatag ctatctctgg tttatcttat accaaggaac gtgctcaagc ctatgacttt 3 60 

tcagaagctt attataaaac ggaaaatgct attcttatta aaaagtctga tttgaacaaa 420 

tatacaatga tttcttcttt taataataag actaaagtag ctgttcaaaa aggaacgatt 480 

30 gaagaaggat tagctaaaaa tcaattaaaa caatcaaaca ttacctcttt gacttcgatg 540 

ggcgaagctg ttaatgagct caaatctggt caggttgatg ctattgatct tgaaaaacca 600 

gtggcagaag gttatgtgtc tcaaaatagt gatttggttc ttgccaaagt tgccttaaaa 660 

acgggtgaag gggatgccaa agcagttgct ctgcctaaag acagtggtca attagttaag 72 0 

acggtgaata aggttattaa gaaactcaaa aaagaagata aatacaagca gtttatcagc 780 

35 gatgctgtta aattaactgg tcagcaagtg gattga 816 
<212> Type : DNA 
<211> Length : 816 

SequenceName : SEQ ID 608 
SequenceDe script ion : 

40 

Sequence 



<213> OrganistnName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

45 atgaaaaagc attttttcat gacttttagc ctcttgctag cggctgtttt tctagttgct 60 

tgttccaatc tttccgattc tggacagagg aattgggata agataaataa gagaggaatg 12 0 

cttaaaattg ctactgcagg aacgctttat ccgcaatctt atcatgatga tcataataaa 180 

ttgacgggtt atgatgttga aattctaaaa gaaataggaa aacgtttggg attgaaagtt 240 

cagtttactg aaatgggtgt cgatggtatg ctgacagcca tcaagagcgg tcagatcgat 300 

50 gttgctaatt attccctaga agacggcaac aaaaatatca gtaagttttt gagaacctct 360 

ccctataaat attcttttac gtcaatggtt gtccgctcta aagatgattc aggtattcat 420 

tcttggtcag accttaaggg aaaaaaagct gccggagctg ccagcactaa ttatatgaag 48 0 

attgctaaaa aattaggagc aaaattagtt gtctatgata atgtcaccaa cgatgtttat 540 

atgaaagatt tagttaatgg tcgtacagat gtcattatca atgattatta tctgcaaaag 600 

55 atagctgttg cagcagtcaa agacaaatac gctatcaaaa taaaccaagg actttatgcc 660 

aatccttaca gcactagttt tacattgtct ttgaaaaaca aagtactgca aaagaaaatc 720 

aataaggctg tgaaagacat gcgcaaggat ggcaccctaa ccaagctatc taagaagttt 780 

ttccaaggag aagacgtcac taaaaaacat tataatagct ataaaaaaat tgatatttct 840 

gacgttgatt aa 852 

60 <212> Type : DNA 

<211> Length : 852 

SequenceName : SEQ ID 609 
SequenceDescription : 

65 Sequence 



<213> Organi smName : Streptococcus pneumoniae R6 
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50 



Sequence 



<40 0> PreSequenceString : 

atgaagcttt tgaaaaaaat gatgcaagtc gcactagcca catttttctt cggtttgcta 60 

gggaccagta cagtatttgc agatgattct gaaggatggc agtttgtcca agaaaatggt 12 0 

agaacctact acaaaaaggg ggctctaaaa gaaacctact ggagagtgat agatgggaag 18 0 

5 tactattatt ttgatccttt atccggagag atggttgtcg gctggcaata tatacctgct 24 0 

ccacacaagg gggttacgat tggtccttct ccaagaatag agattgctct tagaccagat 3 00 

tggttttatt ttggtcaaga tggtgtctta caagaatttg ttggcaagca agttttagaa 36 0 

gcaaaaactg ctacgaatac caacaaacat catggggaag aatatgatag ccaagcagag 42 0 

aaacgagtct attattttga agatcagcgt agttatcata ctttaaaaac tggttggatt 48 0 

10 tatgaagagg gttattggta ttatttacag aaggatggtg gctttgattc tcgcatcaac 54 0 

agattgacgg ttggagagct agcacgtggt tgggttaagg attaccctct tacgtatgat 60 0 

gaagagaagc taaaagcagc tccatggtac tatctagatc cagcaactgg ctggcaaaac 660 

cttgggaaca aatggtacta tctccgttca tcaggagcta tggcaactgg ttggtatcag 72 0 

.gaaggttcga cttggtacta tctaaatgca . agtaatggag atatgaaaac aggctggttc 780 

15 caagtcaatg gtaactggta ctatgcctat gattcaggtg ctttagctgt taataccaca 840 

gtaggtggtt actacttaaa ctataatggt gaatgggtta agtaa 885 
<212> Type : DNA 
<211> Length : 885 

SequenceName : SEQ ID 610 

20 SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 

25 <4 00> PreSequenceString : 

atgaaacttt tgaaaaaaat gatgcaagtt ctactagcag tctttttctt tggtttgcta 60 

gctacaaata cggtatttgc gaataccaca ggtggccgat ttgttgataa ggataataga 120 

aaatattatg taaaagatga tcataaagca atctattggc ataaaataga cggtaaaact 180 

tactattttg gtgatattgg agagatggtt gtcggttggc aatacttaga aattcctgga 240 

30 acaggttatc gtgataattt attcgataac caaccagtta atgaaattgg ccttcaggag 300 

aagtggtact attttggaca agatggtgct ttgctagaac aaacagataa acaagtacta 3 60 

gaggcaaaaa cgtctgaaaa tacaggaaaa gtatacggtg aacaatatcc tctatctgct 420 

gaaaagagaa cttattattt tgataataat tatgctgtaa agacaggctg gatttatgaa 48 0 

gacggcaatt ggtattattt aaataagcta ggaaattttg gcgatgattc ttacaatcca 54 0 

35 ctaccaattg gtgaagttgc taagggttgg actcaagatt ttcatgttac tattgacatt 60 0 

gatagaagca aacctgctcc atggtactac ctagatgctt caggtaagat gcttacagat 660 

tggcaaaaag taaacggaaa atggtattat tttggctcct ctggttctat ggcaacaggt 720 

tggaaatatg tacgaggcaa atggtattac ttagataata aaaatggtga tatgaaaaca 78 0 

ggatggcaat accttggtaa caagtggtac tacctccgtt catcaggagc tatggtaact 84 0 

40 ggctggtatc aagatggttt aacttggtac tacctaaatg caggtaatgg agacatgaag 900 

acaggttggt tccaggtcaa tggcaaatgg tactatgctt atagctcagg tgccttggca 960 

gtgaatacga ccgtagatgg ctattctgtc aactataatg gcgaatgggt tcaataa 1017 

<212> Type : DNA 
45 <211> Length : 1017 

SequenceName : SEQ ID 611 
SequenceDescription : 



60 



<213> OrganismName : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

atgaataaga aaaaaatgat tttaacaagt ctagccagcg tcgctatctt aggggctggt 

tttgttgcgt ctcagcctac tgttgtaaga gcagaagaat ctcccgtagc cagtcagtct 12 0 

55 aaagctgaga aagactatga tgcagcgaag aaagatgcta agaatgcgaa aaaagcagta 18 0 

gaagatgctc aaaaggcttt agatgatgca aaagctgctc agaaaaaata tgacgaggat 240 

cagaagaaaa ctgaggagaa agccgcgcta gaaaaagcag cgtctgaaga gatggataag 3 00 

gcagtggcag cagttcaaca agcgtatcta gcctatcaac aagctacaga caaagccgca 3 60 

aaagacgcag cagataagat gatagatgaa gctaagaaac gcgaagaaga ggcaaaaact 42 0 

60 aaatttaata ctgttcgagc aatggtagtt cctgagccag agcagttggc tgagactaag 480 

aaaaaatcag aagaagctaa acaaaaagca ccagaactta ctaaaaaact agaagaagct 54 0 

aaagcaaaat tagaagaggc tgagaaaaaa gctactgaag ccaaacaaaa agtggatgct 600 

gaagaagtcg ctcctcaagc taaaatcgct gaattggaaa atcaagttca tagactagaa 660 

caagagctca aagagattga tgagtctgaa tcagaagatt atgctaaaga aggtttccgt 720 

65 gctcctcttc aatctaaatt ggatgccaaa aaagctaaac tatcaaaact tgaagagtta 780 

agtgataaga ttgatgagtt agacgctgaa attgcaaaac ttgaagatca acttaaagct 840 

gctgaagaaa acaataatgt agaagactac tttaaagaag gtttagagaa aactattgct 900 
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10 



15 



gctaaaaaag 
gaaaaaccag 
ccagcgccgg 
aaaccagaaa 
gaatataatc 
aaaacaggct 
gcgacaggat 
gctacaggtt 
gcaacaggtt 
gctacaggtt 
gcaacaggtt 
gctacaggtt 
gctacaggtt 
gcaacaggtt 
aaagcaagcc 
cttgcagtca 



ctgaattaga 
ctccagctcc 
ctcctcaacc 
aaacagatga 
gcttgactca 
ggaaacaaga 
ggctccaaaa 
ggctccaata 
gggctaaagt 
ggctccaata 
gggctaaagt 
ggctccaata 
gggctaaagt 
gggtgaaaga 
aatggttcaa 
acacaactgt 



aaaaactgaa 
agaaactcca 
agctcccgca 
tcaacaagct 
acagcaaccg 
aaacggtatg 
caacggttca 
caatggttca 
caacggttca 
caacggttca 
caacggttca 
caacggttca 
caacggttca 
tggagatacc 
agtatcagat 
agatggctat 



gctgacctta 
gccccagaag 
ccaaaaccag 
gaagaagact 
ccaaaagctg 
tggtacttct 
tggtactacc 
tggtattacc 
tggtactacc 
tggtattacc 
tggtactacc 
tggtactacc 
tggtactacc 
tggtactatc 
aaatggtact 
aaagtcaatg 



agaaagcagt 
caccagctga 
agaagccagc 
atgctcgtag 
aaaaaccagc 
acaatactga 
tcaacagcaa 
tcaacgctaa 
tcaacgctaa 
tcaacgctaa 
tcaacgctaa 
tcaacgctaa 
tcaacgctaa 
ttgaagcatc 
atgtcaatgg 
ccaatggtga 



taatgagcca 

acaaccaaaa 

tgaacaacca 

atcagaagaa 

tcctgcacca 

tggttcaatg 

tggtgctatg 

cggtgctatg 

tggtgctatg 

cggcgctatg 

tggtgctatg 

cggtgctatg 

tggtgctatg 

aggtgctatg. 

tttaggtgcc 

atgggtttaa 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



<212> Type : DNA 
<211> Length : I860 
20 SequenceName : SEQ ID 612 

SequenceDescription : 



Sequence 



25 <213> Organ ismtsTame : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

atgaaaattt taccgtttat agcaagagga acaagttatt acttgaagat gtcagttaaa 60 

aagcttgttc cttttttagt agtaggattg atgctagcag ctggtgatag tgtctatgcc 12 0 

tattccagag gaaatggatc gattgcgcgt ggggatgatt atcctgctta ttataaaaat 180 

30 gggagccagg agattgatca gtggcgcatg tattctcgtc agtgtacttc ttttgtagcc 240 

tttcgtttga gtaatgtcaa tggttttgaa attccggcag cttatggaaa tgcgaatgaa 3 00 

tggggacatc gtgctcgtcg ggaaggttat cgtgtagata atacaccgac gattggttcc 3 60 

attacttggt ctactgcagg aacttatggt catgttgcct gggtgtcaaa tgtaatggga 420 

gatcagattg agattgagga atataactat ggttatacag aatcctataa taaacgagtt 480- 

35 ataaaagcaa acacgatgac aggatttatt cattttaaag atttggatag tggcagtgtt 540 

gggaatagtc aatcctcagc ttcaacaggc ggaactcatt attttaagac caagtctgct 60 0 

attaaaactg aacccctagt tagtgcaact gtgattgatt actattatcc tggggagaag 660 

gttcattatg atcagatact tgaaaaagac ggctataagt ggttgagtta tactgcctat 720 

aatggaagct atcgttatgt tcaattggag gctgtgaata aaaatcctct aggtaattct 780 

40 gttctttctt caacaggagg aactcattat tttaagatca agtctgctat taaaactgaa 840 

cccctagtta gtgcaactgt gattgattac tattatcctg gagagaaggt tcattatgat 900 

cagatacttg aaaaagacgg ctataagtgg ttgagttata cggcttataa cggaagtcgt 960 

cgctatatac agctagaggg agtgacttct tcacaaaatt atcagaatca atcaggaaat 102 0 

atctctagct atggatccaa taatagttca actgtcggtt ggaagaaaat aaatggtagt 1080 

45 tggtatcatt tcaaatcaaa tggttctaaa tcaacaggat ggctgaaaga cggttctagc 114 0 

tggtattatt tgaaattatc tggtgaaatg cagacaggat ggttaaagga gaatggctcg 1200 

tggtattatc tgggtagttc aggggcaatg aaaacaggct ggtaccaggt ctctggtgag 1260 

tggtattatt cttactcttc aggcgcctta gctattaata cgacggtgga tggctacaga 132 0 

gtaaacagtg atggagaacg agtatag 1347 

50 <212> Type : DNA 

<211> Length : 1347 

SequenceName : SEQ ID 613 
SequenceDescription : 

55 Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

atgtttgcat caaaaagcga aagaaaagta cattattcaa ttcgtaaatt tagtattgga 60 

60 gtagctagtg tagctgttgc cagtcttgtt atgggaagtg tggttcatgc gacagagaac 120 

gagggaagta cccaagcagc cacttcttct aatatggcaa agacagaaca taggaaagct 18 0 

gctaaacaag tcgtcgatga atatatagaa aaaatgttga gggagattca actagataga 240 

agaaaacata cccaaaatgt cgccttaaac ataaagttga gcgcaattaa aacgaagtat 300 

ttgcgtgaat taaatgtttt agaagagaag tcgaaagatg agttgccgtc agaaataaaa 360 

65 gcaaagttag acgcagcttt tgagaagttt aaaaaagata cattgaaacc aggagaaaag 42 0 

gtagcagaag ctaagaagaa ggttgaagaa gctaagaaaa aagccgagga tcaaaaagaa 480 

gaagatcgtc gtaactaccc aaccaatact tacaaaacgc ttgaacttga aattgctgag 540 
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ttcgatgtga aagttaaaga agcggagctt gaactagtaa aagaggaagc taaagaatct 600 

cgaaacgagg gcacaattaa gcaagcaaaa gagaaagttg agagtaaaaa agctgaggcfc 660 

acaaggttag aaaacatcaa gacagatcgt aaaaaagcag aagaagaagc taaacgaaaa 72 0 

gcagatgcta agttgaagga agctaatgta gcgacttcag atcaaggtaa accaaagggg 780 

5 cgggcaaaac gaggagttcc tggagagcta gcaacacctg ataaaaaaga aaatgatgcg 840 

aagtcttcag attctagcgt aggtgaagaa actcttccaa gctcatccct gaaatcagga 900 

aaaaaggtag cagaagctga gaagaaggtt gaagaagctg agaaaaaagc caaggatcaa 960 

aaagaagaag atcgccgtaa ctacccaacc aatacttaca aaacgcttga ccttgaaatt 102 0 

gctgagtccg atgtgaaagt taaagaagcg gagcttgaac tagtaaaaga ggaagctaag 10 80 

10 gaacctcgag acgaggaaaa aattaagcaa gcaaaagcga aagttgagag taaaaaagct 1140 

gaggctacaa ggttagaaaa catcaagaca gatcgtaaaa aagcagaaga agaagctaaa 1200 

cgaaaagcag cagaagaaga taaagttaaa gaaaaaccag ctgaacaacc acaaccagcg 12 60 

ccggctactc aaccagaaaa accagctcca aaaccagaga agccagctga acaaccaaaa 13 20 

gcagaaaaaa cagatgatca acaagctgaa gaagactatg ctcgtagato- agaagaagaa 1380 

15 tataatcgct tgactcaaca gcaaccgcca aaaactgaaa aaccagcaca accatctact 1440 

ccaaaaacag gctggaaaca agaaaacggt atgtggtact tctacaatac tgatggttca 15 00 

atggcaacag gatggctcca aaacaacggt tcatggtact atctaaacgc taatggtgct 1560 

atggcgacag gatggctcca aaacaatggt tcatggtact atctaaacgc taatggttca 1620 

atggcaacag gatggctcca aaacaatggt tcatggtact acctaaacgc taatggtgct 1680 

20 atggcgacag gatggctcca atacaatggt tcatggtact acctaaacag caatggcgct 1740 

atggcgacag gatggctcca atacaatggc tcatggtact acctcaacgc taatggtgat 18 00 

atggcgacag gatggctcca aaacaacggt tcatggtact acctcaacgc taatggtgat 18 60 

atggcgacag gatggctcca atacaacggt tcatggtatt acctcaacgc taatggtgat 192 0 

atggcgacag gttgggtgaa agatggagat acctggtact atcttgaagc atcaggtgct 1980 

25 atgaaagcaa gccaatggtt caaagtatca gataaatggt actatgtcaa tggctcaggt 2 04 0 

gcccttgcag tcaacacaac tgtagatggc tatggagtca atgccaatgg tgaatgggta 2100 

aactaa 2106 
<212> Type : DNA 
<211> Length : 2106 

30 SequenceName : SEQ ID 614 

SequenceDescription : 

Sequence 



35 <213> Organi STriName : Streptococcus pneumoniae R6 
<4 00> PreSequenceString : 

- atgaaaaaaa ctacaatatt atcattaact acagctgcgg ttattttagc agcatatgtc 60 

cctaatgaac caatcctagc agcatatgtc cctaatgaac caatcctagc agatactcct 120 

agttcggaag taatcaaaga gactaaagtt ggaagtatta ttcaacaaaa taatatcaaa 180 

40 tataaggttc taactgtaga aggtaacata ggaactgttc aagtgggtaa tggagttact 240 

cctgtagagt ttgaagctgg tcaagatgga aaaccattca cgattcctac aaaaatcaca 3 00 

gtaggtgata aagtatttac cgttactgaa gtagctagtc aagcttttag ttattatcca 360 

gatgaaacag gtagaattgt ctactatcct agctctatta ctatcccatc aagcataaaa 420 

aaaatacaaa aaaaaggctt ccatggaagt aaagctaaaa ctattatttt tgacaaaggc 4 80 

45 agtcagctgg agaaaattga agatagagct tttgattttt ctgaattaga agagattgaa 540 

ttgcctgcat ctctagaata tattggaaca agtgcatttt ctfcttagtca aaaattgaaa 600 

aagctaacct tttcctcaag ttcaaaatta gaattaatat cacatgaggc ttttgctaat 660 

ttatcaaatt tagagaaact aacattacca aaatcggtta aaacattagg aagtaatcta 72 0 

tttagactca ctactagctt aaaacatgtt gatgttgaag aaggaaatga atcgtttgcc 780 

50 tcagttgatg gtgttttgtt ttcaaaagat aaaactcaat taatttatta tccaagtcaa 840 

aaaaatgacg aaagttataa aacgcctaag gagacaaaag aacttgcatc atattcgttt 900 

aataaaaatt cttacttgaa aaaactcgaa ttgaatgaag gtttagaaaa aatcggtact 960 

tttgcatttg cggatgcgat taaacttgaa gaaattagct taccaaatag tttagaaact 1020 

attgaacgtt tagcctttta cggtaattta gaattaaaag aacttatatt accagataat 1080 

55 gttaaaaatt ttggtaaaca cgttatgaac ggtttaccaa aatttttaac attatctggt 1140 

aataatatca actcattgcc gtccttcttc ctaagtggcg tcttagattc attaaaggaa 1200 

attcatatta agaataaaag tacagagttt tctgtgaaaa aagatacatt tgcaattcct 12 60 

gaaactgtta agttctatgt aacatcagaa catataaaag atgttcttaa atcaaattta 13 20 

tctactagta atgatatcat tgttgaaaaa gtagataata taaaacaaga aactgatgta 13 8 0 

60 gctaaaccta aaaagaattc taatcaggga gtagttggtt gggttaaaga caaaggttta 1440 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggttta 1500 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggttta 1560 

tggtattact taaatgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1620 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1680 

65 tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1740 

tggtattact taaatgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 18 00 

tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 18 60 
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tggtattact taaacgaatc aggttcaatg gctactggtt gggttaaaga caaaggctta 1920 
tggtattact taaatgaatc aggttcaatg gctactggtt gggttaaagt ttctggtaaa 1980. 
tggtactata cctataattc aggagattta ttagtaaaca cgactacacc cgatggctat 2040 
cgagtcaatg ctaacggtga gtgggtagga tag 2073 
5 <212> Type : DNA 

<211> Length : 2073 

SequenceName : SEQ ID 615 

SequenceDe script ion : 

10 Sequence 



<213> OrganismMarae : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

atggaaatta ■ atgtgagtaa attaagaaca gatttgcctc aagtcggcgt gcaaccatat 60 

15 aggcaagtac acgcacactc aactgggaat ccgcattcaa ccgtacagaa tgaagcggat 120 

tatcactggc ggaaagaccc agaattaggt tttttctcgc acattgttgg gaacggttgc 180 

atcatgcagg taggacctgt tgataatggt gcctgggacg ttgggggcgg ttggaatgct 240 

gagacctatg cagcggttga actgattgaa agccattcaa ccaaagaaga gttcatgacg 3 00 

gactaccgcc tttatatcga actcttacgc aatctagcag atgaagcagg tttgccgaaa 360 

20 acgcttgata cagggagttt agctggaatt aaaacgcacg agtattgcac gaataaccaa 420 

ccaaacaacc actcagacca cgttgaccct tatccatatc ttgctaaatg gggcattagc 480 

cgtgagcagt ttaagcatga tattgagaac ggcttgacga ttgaaacagg ctggcagaag 540 

aatgacactg gctactggta cgtacattca gacggctctt atccaaaaga caagtttgag 600 

aaaatcaatg gcacttggta ctactttgac agttcaggct atatgcttgc agaccgctgg 660 

25 aggaagcaca cagacggcaa ctggtactgg ttcgacaact caggcgaaat ggctacaggc 72 0 

tggaagaaaa tcgctgataa gtggtactat ttcaacgaag aaggtgccat gaagacaggc 780 

tgggtcaagt acaaggacac ttggtactac ttagacgcta aagaaggcgc catggtatca 840 

aatgccttta tccagtcagc ggacggaaca ggctggtact acctcaaacc agacggaaca 900 

ctggcagaca ggccagaatt cacagtagag ccagatggct tgattacagt aaaataa 957 

30 

<212> Type : DNA 
<211> Length : 957 

SequenceName : SEQ ID 616 

SequenceDescription : 

35 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

40 atgactttcg cctattggtg tattctgatt gcctacctat tgccgctttt ttgtgcggcg 60 
tatgccaaaa aagcgggcgg attccggttt aaagacaacc acaatccgcg cgattttctg 120 
gcgcgcacgc aaggcacagc cgcccgtgcc cacgccgcgc agcaaaacgg ttttgaagcc 180 
tttgcaccgt ttgcagccgc cgttttgacg gcacacgcaa ccggcaatgc cggacaagca 240 
accgtcaaca cgcttgccgg cctgttcatc ctgttccgcc tcgcctttat ctggtgctac 3 00 

45 atcgcagaca aagcagcatt acgctcgctg atgtgggtgg gcggatttgt ctgcaccgtc 360 
gggctgtttg tcgtggctgc ttga 3 84 

<212> Type : DNA 
,<211> Length : 384 

SequenceName : SEQ ID 617 

50 SequenceDescription : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 

55 <400> PreSequenceString : 

atgaacaaaa tataccgcat catttggaat agtgccctca atgcctgggt cgccgtatcc 60 

gagctcacac gcaaccacac caaacgcgcc tccgcaaccg tgaagaccgc cgtattggcg 120 

acactgttgt ttgcaacggt tcaggcgaat gctaccgatg aagatgaaga agaagagtta 180 

gaatccgtac aacgctctgt cgtagggagc attcaagcca gtatggaagg cagcggcgaa 240 

60 ttggaaacga tatcattatc aatgactaac gacagcaagg aatttgtaga cccatacata 300 

gtagttaccc tcaaagccgg cgacaacctg aaaatcaaac aaaacaccaa tgaaaacacc 360 

aatgccagta gcttcaccta ctcgctgaaa aaagacctca caggcctgat caatgttgaa 420 

actgaaaaat tatcgtttgg cgcaaacggc aagaaagtca acatcataag cgacaccaaa 480 

ggcttgaatt tcgcgaaaga aacggctggg acgaacggcg acaccacggt tcatctgaac 540 

65 ggtatcggtt cgactttgac cgatacgctt gcgggttctt ctgcttctca cgttgatgcg 600 

ggtaaccaaa gtacacatta cactcgtgca gcaagtatta aggatgtgtt gaatgcgggt 660 

tggaatatta agggtgttaa aactggctca acaactggtc aatcagaaaa tgtcgatttc 72 0 
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gtccgcactt acgacacagt cgagttcttg 
gtggaaagca aagacaacgg caagagaacc 
attaaagaaa aagacggtaa gttggttact 
acagacgaag gcgaaggctt agtgactgca 
5 ggttggagaa tgaaaacaac aaccgctaat 
accgttacat caggcacaaa tgtaaccttt 
agtaaagatg atcaaggcaa catcactgtt 
aacgtcaatc agctgcaaaa cagcggttgg 
tcgggcaaag tcatcagcgg caatgtttcg 

10 aacattaatg ccggcaacaa catcgagatt 
acttcgatgg cgccgcagtt ttccagcgtt 
ttaagcgtgg atgacgaggg cgcgttgaat 
cgcattacca atgtcgcccc gggcgttaaa 
aaaggcgtgg -cgcaaaactt gaacaaccgc 

15 ggcatcgccc aagcgattgc aaccgcaggt 
atgatggcga tcggcggcgg cacttatcgc 
agtatttccg acggcggaaa ttggattatc 
catttcggtg cttccgcatc tgtcggttat 
<212> Type : DNA 

20 <211> Length : 1779 

SeguenceName : SEQ ID 618 
SequenceDescription : 



agcgcagata cgaaaacaac gactgttaat 780 

gaagttaaaa tcggtgcgaa gacttctgtt 84 0 

ggtaaaggca aaggcgagaa tggttcttct 900 

aaagaagtga ttgatgcagt aaacaaggct 960 

ggtcaaacag gtcaagctga caagtttgaa 1020 

gctagtggta aaggtacaac tgcgactgta 10 8 0 

atgtatgatg taaatgtcgg cgatgcccta 114 0 

aatttggatt ccaaagcggt tgcaggttct 12 0 0 

ccgagcaagg gaaagatgga tgaaaccgtc 1260 

agccgcaacg gtaaaaatat cgacatcgcc 132 0 

tcgctcggcg cgggggcaga tgcgcccact 1380 

gtcggcagca aggatgccaa caaacccgtc 1440 

gagggggatg ttacaaacgt cgcacaactt 150 0 

atcgacaatg txjgacggcaa cgcgcgtgcg , 156£L 

ctggttcagg cgtatctgcc cggcaagagt 162 0 

ggcgaagccg gttacgccat cggctactcc 168 0 

aaaggcacgg cttccggcaa ttcgcgcggc 1740 

cagtggtaa 1779 



25 



Sequence 



<213> OrganisrtdsFame : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

attcttttgg ctgaaggtca aaaatcagcc gtcaccgagt attacctgaa tcacggcaca 60 
tggcccagca acaacagtga tgccggcgtg gcatccaccg ccaccgacat caaaggaaaa 120 
30 tatgttaaag aagttaaagt cgaaaaaggc gtcattaccg ccacaatgct ttcaagcggc 180 
gtaaacaacg aaatcaaagg caaaaaactc tccctgtggg ccaagcgtca agccggttcg 240 
gtaaaatggt tctgcggaca gccggttgag cgcgccgcca acaacgccgc caacgacgcc 3 00 

gtcaccgccg ccaccgccaa cggcaacggc aagatcgaca ccaaacacct gccgtcaacc 360 
tgccgcgacg cagcatctgc cgtttgcata gaaacaccac ctacggcttt ctataaaaat 420 
35 acctaa 426 
<212> Type : DNA 
<211> Length : 426 

SequenceName : SEQ ID 619 
SequenceDescription^ : 



40 



Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<4 00> PreSequenceString : 

45 atgaaaacaa ccgacaaacg gacaaccgaa acacaccgca aagccccgaa aaccggccgc 60 

atccgcttct cgcctgctta cttagccata tgcctgtcgt tcggcattct tccccaagct 120 

tgggcgggac acacttattt cggcatcaac taccaatact atcgcgactt tgccgaaaat 180 

aaaggcaagt ttgcagtcgg ggcgaaagat attgaggttt acaacaaaaa aggggagttg 240 

gtcggcaaat caatgacaaa agccccgatg attgattttt ctgtggtgtc gcgtaacggc 3 00 

50 gtggcggcat tggtgggcga tcaatatatt gtgagcgtgg cacataacgg cggctataac 360 

aacgttgatt ttggtgcgga aggaagaaat cccgatcagc accgtttttc ttaccaaatt 420 

gtgaaaagaa ataattataa gcctgacaat tcacaccctt acaacggcga ttaccatatg 480 

ccgcgtttgc ataaatttgt cacagatgca gaacctgtcg aaatgacgag tgacatgagg 540 

gggaatacct attccgataa agaaaaatat cccgagcgtg tccgcatcgg ctcaggacac 600 

55 cactattggc gttatgatga tgacaaacac ggcgatttat cctactccgg cgcatggtta 660 

attggcggca atacacatat gcagggttgg ggaaataatg gcgtagttag tttgagcggc 720 

gatgtgcgcc atgccaacga ctatggccct atgccgattg caggtgcggc aggcgacagc 780 

ggttcgccaa tgtttattta tgacaaaaca aacaataaat ggctgctcaa cggagtttta 840 

caaaccggct acccttattc cggcagggaa aacggtttcc agctgatacg caaagattgg 900 

60 ttctacgatg acatttacag aggcgataca cataccgtct tttttgaacc gcgcagtaac 960 

ggacattttt cctttacatc caacaacaac ggtacgggta cggtaacaga aaccaacgaa 1020 

aaggtttcca atccaaagct taaagtacag acagtccgac tgtttgacga atctttgaat 1080 

gaaactgata aagaaccagt ttacgcggca gggggtgtta atcagtaccg tccaaggtta 1140 

aacaacggtg aaaacctttc ttttatcgat tacggcaacg gcaaactcat cttatcaaac 1200 

65 aacatcaacc aaggcgcggg cggtttgtat tttgaaggtg attttacggt ctcgcctgaa 1260 

aacaacgaaa cgtggcaagg cgcgggcgtt catatcagtg aagacagtac cgttacttgg 1320 

aaagtaaacg gcgtggcaaa cgaccgcctg tccaaaatcg gcaaaggcac gctgcacgtt 1380 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



caagccaaag gggaaaacca aggctcgatc agcgtgggcg 
cagcaggcag acgataaagg caaaaaacaa gcctttagtg 
aggggtacgg tgcaactgaa tgccgataat cagttcaacc 
tttcgcggcg gacgtttgga tttaaacggg cattcgcttt 
accgatgaag gggcgatgat tgtcaatcat aatgccacaa 
acagggaatg aaagtattac acaaccgagt ggtaagaata 
aaagaaattg cctacaacgg ttggtttggc gagaaagata 
ctcaaccttg tttaccagcc cgccgcagaa gaccgcaccc 
aatttaaacg gcaacatcac gcaaacaaac ggcaaactgt 
ccgcacgcct acaatcattt aggaagcggg tggtcaaaaa 
gaaatcgtgt gggacaacga ctggatcaac cgcacgttta 
cagggcgggc aggcggtgat ttcccgcaat gttgccaaag 
agcaatcacg cccaagcagt ttttggtgtc gcaccgcatc 
cgttcggact ggacgggtct^gacaaattgt gtcgaaaaaa 
attgcttcat tgactaagac cgacatcagc ggcaatgtca 
ttaaatctca cagggcttgc cacactcaac ggcaatctta 
tatacagtca gccacaacgc cacccaaaac ggcaacctta 
gcaacattta atcaagccac attaaacggc aacacatcgg 
aatctaagca acaacgccgc acaaaacggc agtctgacgc 
aacgtaagcc attccgcact caacggcaat gtctccctag 
tttgaaaaca gccgctttac cggacaactc agcggcagca 
aaagacagcg aatggacgct gccgtcaggc acggaattag 
gccaccatta cactcaattc cgcctatcgc cacgatgctg 
gtgtcagaca cgccgcgccg ccgttcgcgc cgttccctat 
tcggtagaat cccgtttcaa cacgctgacg gtaaacggca 
ttccgcttta tgtcggaact cttcggctac cgaagcgaca 
tccgaaggca cttacacctt ggcggtcaac aataccggca 
caattgacgg tagtggaagg gaaagacaac aaaccgctgt 
ctgcaaaacg aacacgtcga tgccggcgcg tggcgttacc 
gagttccgcc tgcataatcc ggtcaaagaa caagagcttt 
gaagccaaaa aacaggcgga aaaagacaac gcgcaaagcc 
gggcgcgatg ccgccgaaaa gacagaaagc gttgccgaac 
gaaaatgtcg gcattatgca ggcggaggaa gagaaaaaac 
agcgccttgg cgaaacagcg cgaagcggaa acccggccgg 
gcccgccgcg cccgccggga tttgccgcaa ccgcagcccc 
ccgcagcgcg acctgatcag ccgttatgcc aatagcggtt 
ctcaacagcg ttttcgccgt acaggacgaa ttggaccgcg 
aacgccgttt ggacaagcgg catccgggac accaaacact 
gcctaccgcc aacaaaccga cctgcgccaa atcggtatgc 
cgcgtcggca tcctgttttc gcacaaccgg accgaaaaca 
aactcggcac ggcttgccca cggcgccgtt ttcgggcaat 
atcggcatca gcacgggcgc gggttttagc agcggcagtc 
aaaatccgcc gccgcgtgct gcattacggc attcaggcac 
ggattcggca tcgaaccgta catcggcgca acgcgctatt 
cgctacgaaa acgtcaatat cgccaccccc ggtcttgcgt 
attaaggcag attattcatt caaaccggcg caacacattt 
ctgtcctata ccgatgccgc ttcgggcaaa gtccgaacac 
gctcaggatt tcggcaaaac ccgcagtgcg gaatggggcg 
ttcacgctgt ccctccacgc tgccgccgcc aaaggcccgc 
gcgggcatca aattaggcta ccgctggtaa 
<212> Type : DNA 
<211> Length : 4350 

SequenceName : SEQ ID 620 

SequenceDescription : 

Sequence 



acggtacagt 
aaatcggctt 
ccgacaaact 
cgttccaccg 
caacatccac 
tcaatagact 
cgaccaaaac 
tgctgctttc 
ttttcagcgg 
tggaaggtat 
aagcggaaaa 
tggaaggcga 
aaagccatac 
ccattaccga 
gccttgccga 
gtgcaaatgg 
gcctcgtggg 
cttcgggcaa 
tttccgacaa 
ccgataaggc 
aggatacagc 
gcaatttaaa 
caggcgcgca 
tatccgttac 
aattgaacgg 
aattgaagct 
acgaacccgt 
ccgaaaacct 
aactcatccg 
ccgacaaact 
ttgacgcgct 
cggcccggca 
gggtgcaggc 
ctaccaccgc 
aaccgcaacc 
tgagtgaatt 
tgtttgccga 
accgttcgca 
agaaaaacct 
ccttcgacga 
acggcatcgg 
tttcagacgg 
gataccgcgc 
tcgtccaaaa 
tcaaccgcta 
ccatcacgcc 
gcgtcaatac 
taaacgccga 
aactggaagc 



cattttggat 
ggtcagcggc 
ctatttcggc 
tattcaaaat 
cgttaccatt 
taattacagc 
gaacgggcgg 
cggcggaaca 
cagaccgaca 
cccacaagga 
tttccatatt 
ttggcatttg 
aatctgtaca 
cgataaagtg.- 
tcacgctcat 
cgatacacgt 
caatgcccaa 
tgcttcattt 
cgctaaggca 
agtattccat 
attacactta 
ccttgacaac 
aaccggcagt 
accgccaact 
tcaaggaaca 
ggcggaaagt 
aagcctcgat 
taatttcacc 
caaagacggc 
cggcaaggca 
gattgcggcc 
ggcaggcggg 
ggataaagac 
cttcccccgc 
tcaaccccaa 
ttccgccacg 
agaccgccgc 
agatttccgc 
cggcagcggg 
cggcatcggc 
caggttcgac 
catcggaggc 
cggtttcggc 
agcggattac 
ccgcgcgggc 
ttatttgagc 
cgccgtattg 
aatcaaaggt 
gcaacacagc 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
-2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4350 



<213> OrganismName : Neisseria meningitidis 
<400> PreSequenceString : 

atgaacaccc ttcaaaaagg ttttaccctt atcgagctga 
ggcattttgg cggcagtcgc ccttcctgct tatcaagact 
tccgaagcca ttcttttagc cgaaggtcaa aaatcagccg 
cacggcgaat ggcccagcaa caacacttct gccggcgtgg 
ggcaaatatg ttcaaagcgt tgaagtcaaa aacggcgtcg 
agcaacgtaa acaacgaaat caaaggcaaa aaactctccc 
ggttcggtaa aatggttctg cggacagccg gttaagcgca 
gacgacgtca aagccgacac cgccgccaac ggcaagcaga 



serogroup A strain 22491 



tgattgtgat 
acacagcccg 
tcaccgagta 
catcctccac 
ttaccgccac 
tgtgggccaa 
acgacaccgc 
tcgacaccaa 



tgccatcgtc 
cgcacaagtt 
ttacctgaat 
cgacattaaa 
aatggcttca 
gcgtcaagac 
caccaccaac 
gcacctgccg 



60 
120 
180 
240 
300 
360 
420 
480 
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tcaacctgcc gcgacgcagc atctgccgga taa 513 
<212> Type : DNA 
<211> Length : 513 

SequenceName : SEQ ID 621 
5 SequenceDescription : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain S2491 

10 <400> PreSequenceString : 

atgcaagcac ggctgctgat acctattctt ttttcagttt ttattttatc cgcctgcggg 60 

acactgacag gtattccatc gcatggcgga ggtaaacgct ttgcggtcga acaagaactt 120 

gtggccgctt ctgccagagc tgccgttaaa gacatggatt tacaggcatt acacggacga 180 

aaagttgcat tgtacattgc aactatgggc gaccaaggtt caggcagttt gacagggggt 740 

15 cgctactcca ttgatgcact gattcgtggc gaatacataa acagccctgc cgtccgtacc 300 

gattacacct atccacgtta cgaaaccacc gctgaaacaa catcaggcgg tttgacaggt 3 60 

ttaaccactt ctttatctac acttaatgcc cctgcactct cgcgcaccca atcagacggt 420 

agcggaagta aaagcagtct gggcttaaat attggcggga tgggggatta tcgaaatgaa 480 

accttgacga ctaacccgcg cgacactgcc tttctttccc acttggtaca gaccgtattt 540 

20 ttcctgcgcg gcatagacgt tgtttctcct gccaatgccg atacggatgt gtttattaac 600 

atcgacgtat tcggaacgat acgcaacaga accgaaatgc acctatacaa tgccgaaaca 660 

ctgaaagccc aaacaaaact ggaatatttc gcagtagaca gaaccaataa aaaattgctc 720 

atcaaaccaa aaaccaatgc gtttgaagct gcctataaag aaaattacgc attgtggatg 780 

ggaccgtata aagtaagcaa aggaattaaa ccgacagaag gattaatggt cgatttctcc 840 

25 gatatccaac catacggcaa tcatatgggt aactctgccc catccgtaga ggctgataac 900 

agtcatgagg ggtatggata cagcgatgaa gcagtgcgac gacatagaca agggcaacct 960 

tga ~ " " 963 
<212> Type : DNA 
<211> Length : 963 

30 SequenceName : SEQ ID 622 

SequenceDescription : 



Sequence 



35 <213> OrganismName : Neisseria meningitidis serogroup A strain 22491 
<400> PreSequenceString : 

atgcgcccaa tcttcctatc tttcgtttta ttccctattt tgataaccgc ctgcagcaca 60 
ccggacaagt ctgcccgatg ggaaaatatc ggcacaatct caaacggcaa tattcataca 120* 
tatatcaata aagacagcgt gagaaaaaac ggaaatctga tgattttcca agataaaaaa 18 0 

40 gttgttacca atctaaaaca agaacgtttt gccaacaccc ccgcatacaa gactgccatt 240 
gccgagtggg aaatccactg caacaacaaa acataccgct taagttcgct acaattgttt 300 
gatacaaaaa acacggaaat ttccacacaa aactacacag cctcttccct ccgcccgatg 360 
agcatcctgt ccgggacatt aaccgaaaaa caatatgaaa ccgtatgcgg aaaaaaactc 42 0 

tga " ~ 423 

45 <212> Type DNA 

<211> Length : 423 

SequenceName : SEQ ID 623 
SequenceDescription : 

50 Sequence 

<213> OrganismName : Neisseria meningitidis serogroup A strain 22491 
<400> PreSequenceString : 

atgaacaaac ttttcattac cgccctgtcc gccctcgcct tgtccgcctg cgccggcact 
55 tgggagggcg cgaaacaaga caccgcccgc aaccttgaca aaacacaggc cgccgccgaa 
cgcgccgccg aacaaacagg caacgccgtc gaaaaaggct gggacaaaac caaagaagcc 
gtcaaaaaag gcggcaatgc cgtcggacgc ggcatttccc atctcggcgg aaaaatcgaa 
aacgccaccg aataa 
<212> Type : DNA 
60 <211> Length : 255 

SequenceName : SEQ ID 624 
SequenceDescription : 

Sequence 
65 

<213> OrganismName : Neisseria meningitidis serogroup A strain 22491 
<400> PreSequenceString : 



60 
120 
180 
240 
255 
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atgaaactcc tcttcatccc cctagtcctc ttcgtcgccg tcgaacattt ctacatcgcc 60 

tggcttgaaa tgacgcagat tcccagcgaa aaagcggcgg aaacgttcaa gctgccttat 12 O 

gaatttatgg aacaaaatcg cgtgcagacc ctgttcggca accaagggct gtataacggc 18 O 

tttctcggca tcgggctggt gtggtcgcgg tttgccgctc cggataacgc ggtgtacggc 24 O 

5 gcaacggtac tgtttctcgg cttcgtcctg attgccgccg cgtggggcgc gttctcttcc 3 0O 

ggcaacaaag gcatactcgt caaacaaggt ttgcccgcat ttttggcagc ggcggcggtg 3 60 

ttggcggtat ga " "~ " J ' " 3?2 
<212> Type : DNA 
<211> Length : 372 
10 SequenceName : SEQ ID 625 
SequenceDescription : 



Sequence 

15 <213> OrganisraName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atggcttcaa gcaacgtaaa caacgaaatc aaagacaaaa aactctccct gtgggccaag 60 
cgtcaagacg gttcggtaaa atggttctgc ggacagccgg ttaagcgcga cgccgccacc 120 
gacgccgacg tcaccgccga cagcggcaac gaaatcgaca ccaagcacct gccgtcaacc ISO 
tgccgcgacg cagcatctgc cgtttgcaca aaaacacccg agtattaccc gaatcacggc 240 
gaatggccga aaaacttcgt cattcccgcg caggcgggaa tccaggtctg tcggcacgga 3 0O 

aacttatcgg gtaaaaaggt ttctccggtc ctgagttcta gattcccact ttcgtgggaa 360 
tga . J ~ 363 

<212> Type : DNA 
25 <211> Length : 363 

SequenceName : SEQ ID 626 
SequenceDescription : 



20 



Sequence 
30 

<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atccttttag ccgaaggtca aaaatcagcc gtcaccgagt attacctgaa tcacggcgaa ■ 60 

tggcccagca acaacacttc tgccggcgtg gcaacctcca ccgacattaa aggcaaatat 120 

35 gttcaaagcg ttgaagtcaa aaacggcgtc gttaccgcca caatggcttc aagcaacgta 180 
aacaacgaaa tcaaaggcaa aaaactctcc ctgtgggcca agcgtcaaga cggttcggta 240 
aaatggttct gcggacagcc ggttaagcgc aacgacaccg ccaccaccaa cgacgacgtc 300 
aaagccgaca ccgccgccaa cggcaagcag atcgacacca agcacctgcc gtcaacagca -3 60 
tcgacaagaa aatcgacacc aaactag " 387 

40 <212> Type : DNA 

<211> Length : 387 

SequenceName : SEQ ID 627 
SequenceDescription : 

45 Sequence 



<213> OrganismName : Neisseria m< 
<400> PreSequenceString : 
atgcccatcc cctttaaacc cgtattggct 

50 tttgcggcag accccgcgcc gcagtccgcc 
acgcacaaaa cccaaaaact cggcgaagaa 
ttggtcaacg acgaacacga cctggtgcgc 
ggcagggcgg gttctaacgg ctttaccata 
aacgttgacg ggctggcgca ggcggaaagc 

55 ggcgcgtacg gcaacttcaa cgccaaccgc 
gtaaccatca ccaaaggcgc ggactcgctc 
gtcaattacc aaaccaaatc cgcaagcgat 
gggataaagg gcggcagcgt cggcaaaaac 
ggcaggctct ttggtttgga tgccttattg 

60 aaaaaccgct cgaccgaggg cgatatcgaa 
accgatacag gcggacccag caagtacctg 
caacccgacc cgcaagaatg ggtaaacaaa 
aacgatcaaa accgtatcgg ctggattttt 
gagctgtcta atttgtggac gggtacgacc 

65 cgccaagacg tgagctaccg ccgccgctcc 
ggcccgtggg acagccttaa gctgcgctac 
acttgggaca tcccgaaaaa ttacgataaa 



mingitidis serogroup A strain Z2491 

gccgccgcca tcgcccaagc gtttcccgcc 60 

caaacgctga acgaaatcac cgttaccggc 120 

aaaatccgcc gcaaaacttt agacaagctc 180 

tacgaccccg gcatttccgt cgtcgaaggc 240 

cgcggcgtgg acaaagaccg cgtcgccatc 3 00 

cgctcttccg aagccttcca agaattgttc 3 60 

aacacttccg agccggaaaa cttttccgaa 420 

aaatccggca gcggcgcatt gggcggcgca 480 

tatgtttccg aagacaagcc ctaccatttg 540 

agccaaaaat tcagcagcat caccgccgcc 600 

gtttataccc gccgcttcgg caaagaaacc 660 

attaaaaacg acggatatgt ctataacccg 720 

acctatgtag ccacaggggt tgcgcgctcc 780 

agcaccctgt tcaagctggg ctacaacttc 840 

gaagactcgc gcaccgaccg ttttaccaac 900 

acgtctgccg caacgggcga ctaccgccac 960 

ggtgtcgaat acaaaaacga attggaacac 1020 

gacaagcagc gcatcgatat gaacacttgg 1080 

agaggcatca acggcgaggt ttaccattcg 1140 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



ttccggcata tccgccaaaa caccgcgcaa 
ttttccaaag ccgtttgggc ggcgcaatac 
aactcggatt acagctattt cgcaaaactg 
gccaaaatca caatgctgat cgaaaaccgg 
gcgtttcact tgggcggcaa cgaccgcttc 
aacagcagca gcgcgaaaga cgatccgaaa 
catttgggtt cggaacgcgc gcacgcgggc 
tttaccaagc atctgcactt gttggcaaaa 
gacgaaactt ggctactgtt cccacacccc 
aaagccgaaa aagccaaaaa ctgggaattg 
ttcaagctct cgggcttcaa aaccaaatac 
gtttcgtcag acgataaaaa caaccccaga 
gtcagctcgc ccgtttggca aaaccaaaac 
tttaacggqa cgtggaacct cgacagtatc 
aacgtcagct acatcaaggg caaggcaacg 
gcgctttcgc cgtggacggc ggtttacagc 
ggcatcaacg cctacgccac gcgcaccgcc 
aacgacgact tgaacaaccc gtggccttat 
gacctttccg cctacctcaa catcggcaaa 
attaccaaca agcagtacta cacttgggaa 
gtcaaccgcg ttgacaacaa aacccacgcc 
agctacaatt tcaccatcga agcgaagttc 
<212> Type : DNA 
<211> Length. : 2433 

SequenceName : SEQ ID 628 
SequenceDe script ion : 

Sequence 



tggactgccg 
ggcttgggcg 
tacgacccca 
tcgaaatata 
cgcctgaatg 
tacaccaccg 
ttcagctacg 
tacagcaccg 
gatttctacc 
ggtctggcgg 
cgcgacttta 
tacgccccgc 
cgctccgccg 
ggtttgccca 
caaaacaacg 
ctgggctatg 
gccaaaaagc 
gccaaacaca 
caggttacgc 
tctttacgca 
ggcatccaac 
taa 



attttgaaaa 
gcggcaaagg 
aaatcctcgc 
aatttgccta 
cgggcatacg 
ccatccgggg 
gcacggggtt 
gcttccgcgc 
tgaaagccaa 
gcagcggcaa 
tcgaattgac 
tttcagacgg 
cctgggtgaa 
aagggctgca 
gcaaagaaac 
acgcgccttc 
cgtccgacac 
gcaaggccta 
tccgcgccgc 
gcatccgcga 
gctttacctc 



acaactcgac 

ggacaatgcc 

ttccaaccaa 

ttggaacaat 

ctacgacaaa 

gcagattccc 

cgactggcgg 

accgacttcg 

cccaaacctg 

agcgggcaac 

gtatatgggc 

tacggcattg 

aggcatagag 

caccggcctc. 

gcccatcaac 

caaacgctgg 

cgtccacagc 

tacgctgttc 

cgcgtacaac 

gttcggcacg 

gccgggcagg 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
162 0 
1680 
1740 
1800 
1860 
1920 
198G 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2433 



serogroup A strain Z2491 



<213> OrganismName : Neisseria meningitidis 
<400> PreSequenceString : 

atgaaaaaat ccctgattgc cctgactttg gcagcccttc ctgttgcagc aatggctgac 60 

gttaccctgt acggcaccat caaaaccggc gtagaaactt cccgctctgt agaacacaat 120 

ggaggtcagg tggttagcgt tgaaaccggt accggcatcg ttgatttggg ttcgaaaatc 180 

ggcttcaaag gccaagaaga cctcggtaac ggcctgaaag ccatttggca ggttgagcaa 240 

aaggcatcta tcgccggcac tgactccggt tggggcaacc gccaatcctt catcggcttg 300 

aaaggcggct tcggtaaatt gcgcgtcggc cgtttgaaca gcgtcctgaa agacaccggc 360 

gacatcaatc cttgggatag caaaagcgac tatttgggtg taaacaaaat tgccgaaccc 420 

gaagcacgcc tcatttccgt acgctacgat tctcccgaat ttgccggcct cagcggcagc 480 

gtacaatacg cgcttaacga caatgtaggc agacataaca gcgaatctta ccacgccggc 540 

ttcaactaca aaaacggcgg cttcttcgtg caatatggcg gtgcctataa aagacatcag 60 0 

gatgtggatg acgtgaagat tgagaaatac cagattcacc gtttggtcag cggttacgac 660 

aatgatgccc tatacgcttc cgtagccgta cagcaacaag acgcgaaact ggttgaagac 720 

aattcgcaca actctcaaac cgaagttgcc gctaccttgg cataccgctt cggcaacgta 780 
acgccccgcg tttcttacgc ccacggcttc aaaggctcgg ttgatgatgc aaaacgcgac . 840 

aatacttacg accaagtggt tgtcggtgcg gaatacgact tctccaaacg cacttctgcc 900 

ttggtttctg ccggttggtt gcaagaaggc aaaggcgaaa acaaattcgt agcgactgcc 960 

ggcggtgtcg gtctgcgcca caaattctaa ~ 990 
<212> Type : DNA 
<211> Length : 990 

SequenceName : SEQ ID 629 

SequenceDescription : 



55 



60 



65 



Sequence 

<213> OrganismName : Neisseria meningitidis 
<4 00> PreSequenceString : 

atgaaaaccc tgctcctcct catccccctc gtcctcacag 
atacccgccc acggcggcgg caaacgcttt gccgtcgaac 
tcccgcgccg ccgtcaaaga aatggacttg tccgccctga 
tacgtctccg ttatgggcga ccaaggttcg ggcaacataa 
gacgcactga tacgcggcgg ctaccacaac aaccccgaaa 
cccgcctacg acactaccgc caccaccaaa tccgacgcgc 
acatcgcttt tgaacgcccc cgccgccgcc ctgacgaaaa 
cgctccgccg gactgtccgt caacggcacg ggcgactacc 
aacccccgcg acgtttcctt cctgaccaac ctcatccaaa 
atcgaagtcg taccgcccga atacgccgac accgacgtat 



serogroup A strain Z2491 



cctgcggcac 
aagaactcgt 
aaggacgcaa 
gcggcggacg 
gtgccaccca 
tctccagcgt 
acagcggacg 
gcaacgaaac 
ccgtcttcta 
tcgtaaccgt 



actgaccggc 
cgccgcatcg 
agccgccctt 
ctactctatc 
atacagctac 
aaccacttcc 
caaaggcgaa 
cctgctcgcc 
cctgcgcggc 
cgacgtattc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ggcaccgtcc gcagccgcac cgaactgcac ctctacaacg ccgaaaccct taaagcccaa 660 
accaagctcg aatatttcgc cgttgaccgc gacagccgga aactgctgat tgcccctaaa 720 
accgccgcct acgaatccca ataccaagaa caatacgccc tctggatggg accttacagc 780 
gtcggcaaaa ccgtcaaagc ctcagaccgc ctgatggtcg atttctccga catcaccccc 840 
tacggcgaca caaccgccca aaaccgtccc gacttcaaac aaaacaacgg taaaaaaccc 9 00 

gatgtcggca acgaagtcat ccgccgccgc aaaggaggat aa 942 
<212> Type : DNA 
<21l> Length : 942 

SeguenceName : SEQ ID 63 0 

SequenceDescription : 



Sequence 



<-213> OrganismName • Neisseria meningitidis ccro^rovp £ "t^-siri 
<4 00> PreSequenceString : 

atgaataaaa ccttgtctat tttgccggtg gcaatcttac tcggcggctg cgccgccggg 60 
ggcggtaaca cattcggcag cttagacggc ggcacaggta tgggcggcag catcgtcaaa 120 
atggcggtag aaagccaatg ccgtgcggaa ttgaacaaac gcagcgaatg gcgtttgacc 180 
gcgctggcga tgagtgccga aaaacaggcg gaatgggaaa acaagatttg cgcttgcgtc 240 
gcccaagaag cacccaacca gctgaccggc aacgatgtga tgcagatgct ggatccgtcc 300 
acgcgcaatc aggcacttgc cgccctgacc gccaaaacgg tttccgcctg cttcaaacac 360 
ctgtaccgct aa " 372 

<212> Type : DNA 
<211> Length : 372 

SequenceName : SEQ ID 631 

SequenceDescription : 



Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaatccac ttattcatca agcaaaggaa tcatctatgc aaacccgcat cctctccgcc 60 

gtactgctgg ctttttcaac cgctgccttt gccgggggcg cattcacgct gcaattcgac 12 0 

aacccgtccg aagacggcgg cttcacgcaa aaccagattt tgagcgcgcc ttacggcttt 18 0 

ggctgttcgg gcggcaatgc ttcgcccgcg ctgtcgtgga aaaatccgcc cgccgggaca 240 

aaaagtttcg tcctgaccgt ttacgataaa gacgcgccga ccggactggg ctggatgcac 3 00 

tgggtggtcg ccgacattcc cgccgatgtc cgccgccgca atgcgacctc gctgcaatta 360 

agccgctgcg ccagcatcgc cgacgaccag tccgcagcca tatcggcagt aatcagtttg 420 

cagatttgcc gcatcaggtt gacgccttcg tacacggcaa aaccgatgcc gtcatgctgc 48 0 

aaccacgcca acacgccgca aagcgcggcc tccgccgcat tgtgcggcac ttcttcatcc 54 0 

gtcagcaccg ccgccgcata a 561 
<212> Type : DNA * 
<211> Length : 561 

SequenceName : SEQ ID 632 

SequenceDescription : 



Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaataaaa ctttaaaaag gcgggttttc cgccataccg cgctttatgc cgccatcttg 6 0 

atgttttccc ataccggcgg gggggggggg gcgatggcgc aaacccgtca atacgctatt 120 

atcatgaacg agagaaacca gcccgaggta cagtggaatg ggtcatattc aataaaggac 180 

aaagacagga agcgcgaata tactcatcat aatcaccaac aaggaggaag ctctgtctca 240 

ttcaacaata gcgatgagct tgtttctcga caaagcggta ctgccgtttt tggcacagcc 3 00 

acctacctgc cgccctacgg caaggttt cc ggttttgatg ccgccgctct gaaagagcgc 360 

aacaatgccg tcgattggat tcataccacc cacccagggt tgataggcta cagctacgac 42 0 

ggtgtcgtat gcagaagcgc cacagactgt cccaaacttg tctataaaac ccgattttcc 480 

ttcgataatc ccgacttggc aaaaacagga ggcgggttgg ataagcacac agagccaagc 540 

cgcgacaatt cgcccattta caaattgaag gatcatccat ggttgggcgt gtctttcaat 600 

ttgggtgccg agggtatcgc caaaaatggt aagacaatca acaaattggt atcttctttt 660 

aatgaaaaga atagtaataa caacctcgtc tataccacgg aaggccgcga tatttccttg 720 

ggcaactggc agcgcgaaac gaccgccatg gcctattatc tgaacgccaa gctgcacctg 780 

ctggataaaa aacagattca aaatatcacc gacaaaacag tgcagttggg tgtcttgaag 840 

ccgagcatcg atgtgcggac aagaaatacg gggactgccg gcattctatc ttattgggct 900 

aagtgggaca ttaaagatac cgggcagatt ccagtcaagc tcagcttgac gcaagtcaaa 960 

gcaggccgct gcgtcaacaa agataacccc aataagaata ccaaaacctc ttcccccgca 102 0 
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ctgactgccc ccgcgctgtg gttcggagct gggcaagatg gtaaggcgga gatgtattcc 1080 

gcttcggttt ccacctaccc cgacagttcg agcagccgca tcttccttca aaatctgaaa 1140 

agaaaaaccg acaccagcag acccggccgc tattccctcg caaccttgaa taagtcggat 1200 

attgaaagtc gagagccgag tttcacaagt cggcaaaccg tcatccgatt ggatggcggc 1260 

gtacagcaga tcaaactgga tagaaacaat actgaggtca ctggttttaa tggaaatgac 1320 

ggcaaaaacg acactttcgg cattgttagt gaagggagct tcatgcctga tgccagcgag 13 80 

tggaaaaaag tattgctgcc ttggacggtt cgtgctttca attatgacgg tcgatttaac 1440 

acagtcaaca aagaagaaaa caacggcaag ccaaaataca gtcaaaaata ccgcagccgc 1500 

aacaacggca agcacgagcg caatttgggc gacatcgtca acagccccat cgtggcggtc 1560 

ggcgagtatt tggctacttc cgccaacgac gggatggtgc atatcttcaa acaaagcggc 1620 

ggggacaagc gcagctacaa tctgaagctc agctacatcc ccggcacgat gccgcgcaag 168 0 

gatattgaaa gcaaagactc cacccttgcc aaagagctgc gcgccttfcgc cgaaaaaggc 1740 

tatgtgggcg accgctacgg cgtggacggc ggctttgtct tgcgccgcat tacagatgac 1800 

cap.gacaagc aaaaacactt ccttatgttc ggcgcaatgg gccttggcgg cagaggcgca 1860 

tacgccttgg atttgaccaa agccgacgac aatgacccga caaaagcctc tttgtttgat 1920 

gtaaaagata acggcaataa tggcaataac ggcaataatc gcgtggaatt aggctacacc 1980 

gtcggcacgc cgcaaatcgg caaaacccac aacggcaaat acgccgcctt cctcgcctcc 2040 

ggttatgcga ctaaacagat tgacagcggc gagaataaaa ccgcgctgta tgtgtatgat 2100 

ttggaaagca acaacggtac gctgattaga aaaatcgaag taaccgacgg caagggcggg 2160 

ctttcgtccc ccacgctggt ggataaagat ttggacggca cggtcgatat cgcctatgcc 2220 

ggcgatcgcg gcggcaagat gtaccgcttt gatttaagcg gcaacaaccc gaacagttgg 2280 

actgtacgca ctattttcca aggcacgaag ccgattactt ccgcgcccgc catttcccaa 2340 

ctgaaagaca aacgcgtggt tatcttcggt acgggcagtg atttgagtga ggatgatgta 2400 

ctcagtacgg atgaacaaca tatttacggt atttttgaca atgacacaaa cacgggtacg 2460 

gcgcaagagg ggctgggcaa agggctgctc gagcaaaagc ttagtgagga aaataaaacc 2520 

ttattcctga ccgattataa gcgatccgac ggctcgggcg acaagggctg ggtagtgaaa 258 0 

ttgaaggacg gacagcgcgt taccgtcaaa ccgaccgtgg tattgcgtac cgcctttgta 2640 

accatccata aatatacggg taatgacaaa tgcggcgcgg aaaccgccat tttgggcatc 2700 

aataccgccg acggcggcaa gctgaccaag aaaagcgcgc gcccgattgt gccggcagcc 2760 

aattcgaagg tcgcgcaata ttccggcgat aagaaaactt ccagcggcaa atccatccct 2820 

ataggttgta tggaaaaaga cgggggaacc gtctgcccga acggatatgt ttacgacaaa 2880 

ccggttaatg tgcgctacct ggacgaaaag aaaacagacg gattttcaac aacggcagac 2940 

ggcgatgcgg gcggcagcgg aacattcaaa gagggtaaaa aacccgcccg caataaccgg 3 000 

tgcttctccg gaaaaggggt gcgcaccctg ctgatgaacg atttggacag cttggatatt 3060 

accggcccga tgtgcggtat gaaacgaatc agctggcgtg aagtcttcta ctga 3114 

<212> Type : DNA 
<211> Length : 3114 

SequenceName : SEQ ID 633 

SequenceDescription : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<400> PreSequenceString : 

atgaaacacc ccaaactcac ccttatcgcc gcattgctga ccactgccgc aactgccgcc 60 

cccctgccgg ttgtaaccag cttcagcatt ttaggcgacg tagccaaaca aatcggcgga 12 0 

gagcgcgtat ccatacaaag tttggtcgga gccaaccaag atacgcacgc ctatcatatg 18 0 

accagcggcg acattaaaaa aatccgcagt gcaaaactcg tcctgattaa cggcttagga 240 

cttgaagctg ccgacatcca acgtgccgtc aaacagagca aagtatccta tgccgaagcg 300 

accaaaggca tccaacccct caaagccgaa gaagaaggcg gacaccatca cgaccacgat 360 

catgaccacg accatgacca cgaaggacac caccacgacc acggcgaata tgacccccac 42 0 

gtctggaacg accccgtcct tatgtccgcc tatgcccaaa acgtcgccga agccctgata 48 0 

aaggccgacc ccgaaggcaa agtttattat caacaacgct tgggcaacta ccaaatgcag 540 

ctcaaaaaac tgcacagtga cgcacaagcc gcatttaatg ccgtccctgc cgccaaacgc 600 

aaagtcctga ccgggcacga tgccttttcc tatatgggca aacgttacca tatcgaattc 660 

atcgccccac aaggtgtgag cagcgaagcc gagccttcag ccaaacaagt cgccgccatc 720 

atccgacaaa tcaaacgcga aggcatcaaa gccgtattta ccgaaaatat caaagacacc 780 

cgcatggttg accgcatcgc caaagaaacc ggtgtcaacg tcagcggcaa actgtattcc 840 

gacgcactcg gcaacgcacc cgcagacacc tacatcggca tgtaccgcca caacatcaaa 90 0 

gccttaacca acgcgatgaa gcaataa " 927 
<212> Type : DNA 
<211> Length : 927 

SequenceName : SEQ ID 634 

SequenceDescription : 



Sequence 
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<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgaaaaaaa gaattttatc agcagttctt gtaagtggtg ttaccctcgg agcagctaca 60 

actgtaggag cggaggattt gagtactaag attgctaagc aggattctat tatctcaaat 120 

ctgactacag agcaaaaagc tgcacagaat caagtttcag cgttacaggc tcaagtaagt 180 

tcactacaat ctgaacaaga taaactgacc gcaagaaata cagaacttga ggcactttca 240 

aagcgatttg agcaagaaat taaggctcta acaagtcaaa ttgttgctcg taatgaaaaa 3 00 

ttaaaaaatc aagctcgtag tgcttataaa aacaatgaaa cttctggtta cattaatgca 3 60 

cttttgaatt ctaaatcaat ttctgatgtt gtaaaccgtt tagtagcaat taatagagct 42 0 

gtctctgcta acgctaaatt gttagaacaa caaaaagctg ataaagtttc ccttgaagaa 480 

aagcaagctg ctaaccaaac agctattaat accattgccg ctaatatggc aatggctgaa 540 

gaaaaccaaa atacattacg tactcaacaa gctaatttgg aagctgcaac tgcaaattta 600 

gctctccaat tagcatctgc tactgaagat aaagctaatt tggtagctca aaaagaagct ,660 

gcagaaaaag ctgctgctga agccttagca caagaacagg ctgctaaagt taaggcacaa 720 

gaacaggctg cacaacaagc agcatctgtt gaagcagcaa aatctgctat tactccagca 780 

ccacaagcta ctccggcagc gcaaagtagt aatgctattg aaccagctgc actcacggct 84 0 

ccggcagctc cttctgcaag accacaaaca tcatatgatt cttctaatac ttatccagtt 900 

ggacaatgca catggggagc taaatcttta gctccttggg caggaaataa ttggggaaat 960 

ggtggtcaat gggcttatag tgctcaagca gctggttatc gtactggttc aacgccgatg 1020 

gtaggtgcga ttgccgtttg gaacgatggt ggttatggac atgtcgccgt tgtagttgag 1080 

gttcaaagtg cctcaagtat tcgtgtgatg gagtctaact acagtggtag acagtacatt 1140 

gctgatcacc gtggttggtt taatccaaca ggtgttacat ttatttatcc acactaa 1197 

<212> Type : DNA 
<211> Length : 1197 

SequenceName : SEQ ID 635 

SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<40 0> PreSequenceString. : 

gtgattacaa ttaaaaatcc aaaaatcctt aagtggctaa agtatgtatt aagtgcaatt 60 

cttagcctta ttatccttgt tattattatt ggtggtcttt tgtttacctt ctacattagc 120 
agtgctccga aactgtcaga agcccagtta aaatcaacaa actctagctt ggtttatgac * 180 

ggtaataaca atctgattgc tgatttgggt tctgaaaagc gtgaaaatgt aacagctgat 240 

agtatcccta ttaatctagt taatgctatt acctcaattg aagataaacg tttctttaac 300 

catcgtggag tagatcttta tcgtattttt ggtgctgcct ttcataatct aacgagtcag 3 60 

accactcaag gggggtcaac gcttgatcag caactcatta aactagccta tttttctact 420 

aatgaatctg atcaaacctt aaaacgtaag gctcaagaag tttggcttgc tcttcaaatg 480 

gagcgaaaat atactaaaca agaaatcctg actttttaca tcaacaaagt atatatgggt 540 

aatggcaact atggtatgct gacagccgct aagtcttatt atggcaagga tcttaaggat 600 

ttatcttatg cccagctagc cctattggct ggaatccctc aagctcctag tcaatatgat 660 

ccttaccttc atcctgaagc tgctcaaaat cgccgtaacg tcgtgttgca acagatgtac 720 

atggaaaaac atctgacgaa agcagaatat gaaactgcca tcgcaactcc cgtcgctgaa 780 

ggtctacaat cactccaaca gcgctcaact tatccaaaat atatggataa ttatctaaaa 840 

caagttattg aagaagtcaa aaaggaaacg aataaagata tttttaccgc tggtttaaaa 900 

gtttatacca atattatccc cgatgcgcag cagactcttt ataatattta tcattctggt 960 

gattatgttt actatccaga ccaagatttc caagttgctt caacgattgt tgatgtgaca 1020 

aatggtcatg ttattgctca gcttggcgga cgtaatcagg atgaaaatgt ttcatttggg 1080 

actaaccaag ctgttttaac tgatcgtgac tggggttcta ccatgaagcc aatcacagcc 1140 

tatgctcctg ctattgaatc tggtgtttat acttctactg ctcagtcgac taatgactca 1200 

gtctattatt ggcctggaac cactacccaa ttgtttaact gggaccttag atataacgga 1260 

tggatgacaa tccaagctgc tattatgcta tcgcgaaatg tcccagcagt ccgagcactg 132 0 

gaagccgcag gacttgacta tgctcgatct ttcttaagca gtttaggtat taactatccc 13 80 

gaaatgcact actcaaacgc tatctcaagt aataacagta gctcagataa aaaatatggt 1440 

gcaagtagtg aaaaaatggc cgctgcatac gctgcttttg caaatggtgg tatttatcat 1500 

aaaccaaggt atgtcaataa agtggaattt agtgatggta caagtaaaac atttgatgaa 1560 

aaaggaaaac gtgccatgaa agaaaccacg gcctatatga tgacagatat gttgaaaact 162 0 

gttctcactt atggtacagg tactgctgct gccattcctg gtgttgcgca agctggtaaa 1680 

acagggactt ctaactacac tgatgaggaa ctagctaaaa ttggtgaaaa atacggcctt 1740 

tatccagatt atgttggtac attagcgcca gacgaaaact ttgttggctt tactaagcgc 1800 

tacgccatgg ctgtttggac aggttacaaa aaccgcttga ccccagtata cggatcaagt 1860 

ctagagattg catctgacgt ttatcgtagc atgatgactt acttaacaaa tggttacagt 1920 

gaagactgga ccatgccaaa tggtctttat cgcagtggtg gattcctcta cttaagcgga 1980 

acctatgcga gcaacaccga ctatactaat tcggtttaca acaatcttta cagcaataac 2040 
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acaacaacag cttctagcca aacgacttca gatgatacta gtagtagcaa tgatacaagt 210 0 

aattcaacca atacagacaa caatggcagc catccatcta ccgatgataa aaagacaact 2160 
cattaa 2166 
<212> Type : DMA 
5 <211> Length : 2166 

SequenceName : SEQ ID 636 

SeguenceDe script ion : 

Sequence 
10 

<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<4 00> PreSequenceString : 

atgattatta ctaaaaagag cttatttgtg acaagtgtcg ctttgtcgtt agcacctttg 60 

gtgacagcgc aggcacaaga gtggacacca- cgatcggtta cagaaatcaa gtctgaactc ...A 20 . 

15 gtcctagttg' ataatgtttt tacttatact gtaaaatacg gtgacacttt aagcacaatt 180 

gctgaagcaa tgggaattga tgtgcatgtc ttaggagata ttaatcatat tgctaatatt 240 

gacttaattt ttccagacac gatcctaaca gcaaactaca accaacacgg tcaggcaacg 3 00 

actttgacgg ttcaagcacc tgcttctagt ccagctagcg ttagtcatgt acctagcagt 360 

gagccattac cccaagcatc tgccacctct caatcgactg ttcctatggc accatctgcg 42 0 

20 acaccatctg atgttccaac gacaccatta gcatctgcaa agccagatag ttttgtgaca 480 

gcgtcatctg agctcacatc gtcaacgaat gatgtttcga ctgagttgtc tagcgaatca 540 

caaaagcagc cagaagtatc acaagaagca gttccaactc ctaaagcagc tgaaacgact 600 

gaagtcgaac ctaagacaga catctcagaa gatccaactt cagctaatag gcctgttcct 660 

aacgagagtg cttcagaaga agcttcttct gcggccccag cacaagctcc agcagaaaaa 720 

25 gaagaaacct ctcagatgtt aactgcgcca gcggcacaaa aagctgtagc tgacaccaca 780 

agtgttgcaa cctcaaacgg cctttcttac gctccaaacc atgcctacaa tccaatgaat 840 

gcagggcttc aaccacaaac agcagcgttc aaagaagaag tggcttctgc ctttggtatt 900 

acgtcgttta gtggttaccg tccaggagat ccaggagatc atggtaaagg attagccatt 960 

gactttatgg taccggttag ctctacgctt ggtgatcaag ttgctcaata tgccattgac 1020 

30 catatggcag agcgtggtat ttcatacgtt atttggaaac agcgattcta tgcgccattt 10 80 

gcaagtattt acggaccagc ctatacatgg aaccccatgc cagatcgcgg cagtattaca 1140 
gaaaaccatt atgatcatgt tcatgtctcc tttaatgctt aa 1182 
<2!2> Type : DNA 
<211> Length : 1182 

35 SequenceName : SEQ ID 637 

SequenceDescription : 

Sequence 



40 <213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgaaaaaga aaattctttt aatgatgagt ttaatcagtg tcttttttgc ttggcaactt 60 

actcaggcaa aacaagtctt agcagagggt aaagtgaagg tggtgacaac tttctaccct 120 

gtttatgaat ttacaaaagg ggttattggt aatgatggcg atgtttccat gcttatgaaa 180 

45 gcaggaacgg aacctcatga ttttgagcct tctacaaaag acattaaaaa aatccaagat 240 

gcagatgcat ttgtttatat ggatgacaat atggaaactt gggtttctga tgtgaaaaaa 300 

tcattgacat ctaaaaaagt gaccatcgtc aagggaactg gtaacatgct cttggtagca 360 

ggagctggac atgaccatca ccatgaggat gctgacaaaa agcatgagca taataaacat 420 

agcgaagaag gacacaacca tgcttttgac ccacacgtgt ggttgtcacc ataccgtagc 480 

50 attacggtcg ttgaaaatat tcgcgacagt ctttcaaaag cttacccaga aaaagcagag 540 

aacttcaaag ccaatgccgc tacttatatt gaaaaattaa aagagcttga caaagactat 600 

acggcagcac tttcagatgc taagcaaaag agctttgtga cacaacacgc agcttttggt 660 

tatatggcac ttgactatgg cttgaaccaa atttctatta atggtgtcac accagatgca 720 

gaaccatcag caaaacgtat tgctactttg tcaaaatacg ttaaaaaata tggcatcaaa 780 

55 tacatttatt ttgaggaaaa tgcgtcaagt aaagtcgcaa aaaccctagc taaagaagca 840 

ggagttaaag cggctgtgct tagtccgctt gaaggtttga ctaaaaaaga gatgaaagct 900 

ggccaagatt actttacggt catgcgtaaa aaccttgaaa ccttacgctt aaccactgat 960 

gtggctggta aagaaattct tccagaaaaa gacacgacta agacagttta caatggttat 1020 

ttcaaagaca aagaagtcaa agatcgtcaa ttatctgact ggtcaggtag ctggcaatct 1080 

60 gtttacccct atttacaaga tggtacttta gaccaagttt gggactataa ggctaaaaaa 1140 

tctaaaggta aaatgacagc agccgagtac aaagattact acactactgg ttataaaact 12 0 0 

gacgtggaac aaatcaaaat caatggtaag aaaaagacca tgacctttgt tcgtaatggt 1260 

gaaaagaaaa ccttcactta cacatacgcc ggcaaagaaa tcttgaccta tccaaaagga 1320 

aatcgcggag ttcgtttcat gtttgaagct aaagaaccaa atgctggcga attcaaatac 13 80 

65 gttcaattca gtgaccatgc cattgctcct gaaaaagcag agcatttcca cctgtactgg 1440 

ggtggtgaca gccaagaaaa attacataaa gagttagaac attggccaac ttactacggt 150 0 

tcagacttat ctggtcgtga aattgcccaa gaaatcaatg ctcattaa 1548 
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<212> Type : DMA. 

<211> Length : 1548 

SequenceName : SEQ ID 638 
SequenceDescription : 

5 

Sequence 



<213> Organ! smName : Streptococcus pyogenes strain MGAS8232 

<400> PreSequenceString : 
10 atgaaaaaat ttcatcgttt tttggtctca 

gtacctacta tgccatctac acttatttcg 

ttaggcgata actatccgag taagtggaaa 

tatatccgcc aatgcacttc ttttgcagct 

ttacctaaag gctacggtaa tgcctgcacg 
15 cctgtgaata agacaccaag cataggggct 

tcaaatgctg cttacgatca tgtagcatgg 

atcgaagagt ataattacaa cgctggacaa 

ccaaaatctc aggtaagtgg ttatatccat 

tcctacccaa gacaactaaa acacatttct 
20 cactttacaa ccagattacc agtcaaagga 

tactatgaag caggtcaatc tgtttattac 

tggcttagct acctcagttt ttctggaaac 

cagtctgtgg ttcaaaatga caatacaaaa 

ttccctggcg tttttcgtgt agatcagctt 
25 gccggaggag acccaactcc actaaactgg 

aaccaaggaa aagttttagg aaatcaaatt 

ggtagttata aagtattaaa aattgatcaa 

tctcgtggaa catgggtaaa tgctgataaa 

<212> Type : DNA 
30 <211> Length : 1125 

SequenceName : SEQ ID 639 
SequenceDescription : 

Sequence 

35 

<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgttaaaat ttacttcaaa tattttagct actagtgtag ctgaaacaac tcaagttgct 60 
cctggaggct gctgttgctg ctgtactact tgttgcttct caattgctac tggaagtggt 120 
40 aattctcaag gtggtagcgg aagttatacg ccaggtaaat aa 162 
<212> Type : DNA 
<211> Length : 162 

SequenceName : SEQ ID 640 

SequenceDescription : 

45 

Sequence 



ggagtaatcc ttttaggttt taatggtcta 60 

caacaggaaa atcttgttca tgcagctgtt 12 0 

aaaggcaatg gaatcgattc gtggaacatg 180 

tttcgtttaa gctctgctaa tggttttcag 240 

-tggggacata tcgegaaaaa tcagggttat 3JPIA. 

atcgcttggt ttgataaaaa cgcttatcag 3 60 

gtagctgata tccgtggaga cactgtcact 42 0 

ggccctgaaa gataccataa gcgtcaaatt 480 

tttaaagact tatcatctca gacaagtcat 540 

caagcttcat ttgacccctc tggaacttat 600 

caaaccagta tcgatagccc tgatcttgct 660 

gataaagtcg tgactgctgg aggttataca 72 0 

cgacgctata ttcccattaa agagcccgca 780 

ccttccatta aggtcggtga tactgttacc 840 

gttaataatt tgatcgttaa taaagaatta 900 

attgatccca caccattaga tgaaacagat 960 

ctccgtgtgg gtgaatattt taccgtcact 102 0 

ccaagtaatg gtatttatgt tcaaatcgga 1080 

gctaacaaat tatag 1125 



<213> OrganismName : Streptococcus pyogenes 
<400> PreSequenceString : 

50 atgggagaat cttattctgt tgaagcggtt ttgacagctg 
acattacaat cggcaatccg ttcaatcgaa ggcttggaaa 
tcggtgtctc aaaaagctag ttccatgttt aaatccatgt 
caagctatct cggcaatgac aaggacagtg tcatcaggcc 
atgaatagtt cagcgaaagc gtggaaaact tttgacgcta 

55 ggaaaaaaac aaattttggc agttaaaacg gcgatgcaag 
tactcggcat cagatatggc tagcacgtat gcacagttag 
accggaaagc tcgtaaaagc ttttggcggt ttagctgcat 
gctatgaaat caattagtca gcaaatgaca caagctgttg 
caagacttta ggataatgtt ggaacagacg cctgcaggga 

60 atgggtaaaa atcttgatga actcgtcgcc gatatccagg 
gattttttgg aagcggtaaa aaaagcaggc aatgataaga 
gagttcaaaa ctgttgacca agccatcgac ggtatgcgag 
caaccagcgt ttgaaaaagt gaaccaattt ggaattagag 
caactcgata aagttgattt ttctaagttt gctagtaatc 

65 attaatatcg ataaaattgt atctaatatt tcatcggcgg 
gttaaagaat tttgggacgg tttcaaacaa actggagcaa 
ttgcagagcg tttggggagc tttaaaaaat gtcgctagcg 



strain MGAS8232 




ttgataaaac 


ctttggcaaa 


60 


agcgttcaac 


cggtttttca 


120 


taggagcgaa 


tttagctgga 


180 


ttggctctat 


gcttggcgag 


240 


atttagcgga 


cattgggttt 


300 


actatgcaac 


taaaacaatc 


360 


cagcagttgg 


cgtgaaagat 


420 


ctgctgaaaa 


tccgaagcag 


480 


gaagaccaac 


agttgcatgg 


540 


tggctaaagt 


cgctaaatct 


600 


cgggtagggt 


taaaaccagc 


660 


gtttccaaaa 


gatggcaact 


720 


aaggcttatc 


caacaaattg 


780 


cgatcgaagc 


aatcggtaaa 


840 


ttgggaaatt 


ccttgaagga 


900 


tttcatctgt 


cacttcaaag 


960 


ttagtgcttt 


ttcaggagct 


1020 


ccatgagcgg 


agggaattgg 


1080 
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aagacttttg gagcaacagt tggagggatt gttaaacacg tctctaactt cgctaaagct 1140 
gtttccgatg ttttaggaaa gatggaccct ggcagactaa gaagtfcggat agctaccttc 12 00 
gccgcagtag ctggaggttt taagttattc gaaaaattaa cgggacaaag cgtcattggt 1260 
tcttttttgg ataaaattgg cagcaaattt ggtctctttg gaaacaaagc caaagaagga 132 0 
acagacaaag cctctaacgg cgctagaaga agcggtggca ttattagcca aatcttcagc 13 80 
ggcttgggta atatcgttaa gtctgctggt acagccatat caacagctgc aaaaggtatc 1440 
ggagttggta ttaaaactgc tttgtctgga atccccccct atcattag 1488 
<212> Type : DNA 
<211> Length : 1488 

SequenceName : SEQ ID 641 

SequenceDescription : 

Sequence 

<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 
atgaaaaaag gtttttttct catggttatg 
gataagtcag caaaccccaa acagcctacg 
ccaatgtatg cgatgacaaa agaagtatct 
tcaggtgcag gcattcattc ctttgaaccg 
gcggatttgt ttgtttacca ttcacatacc 
aatttaaaaa aatcaaaggt tgatgtgttt 
gtcaaagggc tagaagatat ggaagtcaca 
ccacatacct ggacggatcc cgttttagct 
ctaggacgtt tggatcctaa acacaaagac 
aaagaagcag agcaactaac tgaagaatac 
acattcgtga cgcaacacac ggcattttct 
cttggtatct cgggtatttc tccagagcaa 
caagactttg tcaaagaata caacgtcaag 
aaaattgctc atgctattgc gaaatcaaca 
gaagctgctc caagcggaaa caagacatat 
ctctatcaac agttgaagta a 

<212> Type : DNA 
<211> Length : 921 

SequenceName : SEQ ID 642 

SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

gtggagaaaa agcaacgttt ttcccttaga aaatacaaat caggaacgtt ttcggtctta 60 

ataggaagcg tt'ttcttgat gatgacaaca acagtagcag cagatgagct aagcacaatg 120 

agtgaaccaa caatcacgaa tcacactcaa caacaagcgc aacatctcac caatacagag 18 0 

ttgagctcag ctgaatcaaa atctcaagac acatcacaaa tcactcccaa gacaaatcgt 240 

gaaaaagagc aaccacaagg tctagtctct gagccaacca caactgagct agctgacaca 3 00 

gatgcagcac caatggctaa tacaggtcct gatgcgactc aaaaaagcgc ttctttaccg 3 60 

ccagtcaata cagatgttca cgattgggta aaaaccaaag gagcttggga caagggatac 42 0 

aaaggacaag gcaaggttgt cgcagttatt gacacaggga tcgatccggc ccatcaaagc 480 

atgcgcatca gtgatgtatc aactgctaaa gtaaaatcaa aagaagacat gctagcacgc 540 

caaaaagccg ccggtattaa ttatgggagt tggataaatg ataaagttgt ttttgcacat 600 

aattatgtgg aaaatagcga taatatcaaa gaaaatcaat tcgaggattt tgatgaggac 660 

tgggaaaact ttgagtttga tgcagaggca gagccaaaag ccatcaaaaa acacaagatc 720 

tatcgtcccc aatcaaccca ggcaccgaaa gaaactgtta tcaaaacaga agaaacagat 78 0 

ggttcacatg atattgactg gacacaaaca gacgatgaca ccaaatacga gtcaca'cggt 840 

atgcatgtga caggtattgt agccggtaat agcaaagaag ccgctgctac tggagaacgc 900 

tttttaggaa ttgcaccaga ggcccaagtc atgttcatgc gtgtttttgc caacgacgtc 960 

atgggatcag ctgaatcact ctttatcaaa gctatcgaag atgccgtggc tttaggagca 1020 

gatgtgatca acctgagtct tggaaccgct aatggggcac agcttagtgg cagcaagcct 1080 

ctaatggaag caattgaaaa agctaaaaaa gccggtgtat cagttgttgt agcagcagga 1140 

aatgagcgcg tctatggatc tgaccatgat gatccattgg caataaatcc agactatggt 1200 

ttggtcggtt ctccctcaac aggtcgaaca ccaacatcag tggcagctat aaacagtaag 1260 

tgggtgattc aacgtctaat gacggtcaaa gaattagaaa accgtgccga tttaaaccat 1320 

ggtaaagcca tctattcaga gtctgtcgac tttaaaaaca taaaagatag cctaggttat 138 0 

gataaatcgc atcaatttgc ttatgtcaaa gagtcaactg atgcgggtta taaagcacaa 1440 

gacgttaaag ataaaattgc tttaattgaa cgtgatccca ataaaaccta tgacgaaatg 1500 

attgctttgg ctaagaaaca tggagccctg ggagtactta tttttaataa caagcctggt 1560 



gtcgtgagtt tagtaatgat 
caaggcatgt cagttgtaac 
ggagacctaa atgatgtgag 
tctgtaaatg atgtagcagc 
ttagaagctt gggcaaggga 
gaagcgtcaa aacctttgac 
caaggcattg accctgcgac 
ggtgaagaag ctgttaatat 
agttacacta aaaatgctaa 
actcaaaaat ttaaaaaggt 
tatctggcta aacgattcgg 
gagccctctc ctcgccaatt 
actatttttg cagaagacaa 
ggagctaaag taaagacatt 
ctagaaaatc ttagagcaaa 



agcagggtgt 6 0 

cagcttttac 12 0 

gatgatccaa 180 

tatttatgac 240 

tctagaccct 3 00 

actagataga 3 60 

actttatgac 42 0 

cgctaaagag 480 

ggctttcaaa 540 

gcgctcaaaa 60 0 

cttgaaacaa 660 

gaaagaaatt 72 0 

tgtcaatccc 780 

aagtccactt 840 

tttggaagtg 9 00 
921 
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caatcaaacc gctcaatgcg tctaacagct aatgggatgg ggataccatc tgctttcata 162 0 

tcgcacgaat ttggtaaggc catgtcccaa ttaaatggca atggtacagg aagtfctagag 1680 

tttgacagtg tggtctcaaa agcaccgagt caaaaaggca atgaaafcgaa tcatttttca 1740 

aattggggcc taacttctga tggctattta aaacctgaca ttactgcacc aggtggcgat 1800 

5 atctactcta cctataacga taaccactat ggtagccaaa caggaacaag tatggcctct 1860 

cctcagattg ctggcgccag ccttttggtc aaacaatacc tagaaaagac tcagccaaac 192 0 

ttgccaaaag aaaaaattgc tgatatcgtt aagaacctat tgatgagcaa tgctcaaatt 1980 

catgttaatc cagagacaaa aacgaccacc tcaccgcgtc agcaaggggc aggattactt 2040 

aatattgacg gagctgtcac tagcggactt tatgtgacag gaaaagacaa ctatggcagt 2100 

10 atatcattag gcaacatcac agatacgatg acgtttgatg tgacfcgttca caacctaagc 2160 

aataaagaca aaacattacg ttatgacaca gaattgctaa cagatcatgt agacccacaa 2220 

aagggccgct tcactttgac ttctcgctcc ttaaaaacgt accaaggagg agaagttaca 2280 

gtcccagcta atggaaaagt gactgtaagg gttaccatgg atgtctcaca gttcacaaaa 2340 

gagetaacaa aacagatgtc aaatggUtac tatctagaag gttttgtccg ctttagagat 2400 

15 agtcaagatg accaactaaa tagagtaaac attccttttg ttggttttaa aggacaattt .. 2460 

gaaaacttag cagttgcaga agagtccatt tacaggttaa aatctcaagg caaaaccggt 252 0 

ttttactttg atgaatcagg tccaaaagac gatatctatg tcggtaaaca ctttacagga 2580 

ctcgtcactc ttggttcaga gaccaatgtg tcaaccaaaa cgatttctga caatggtcta 2640 

cacacacttg gtacctttaa aaatgcagat ggcaaattta tcttagaaaa aaafcgcccaa 2700 

20 ggaaaccctg tcttagccat ttctccaaat ggtgataaca accaagattt tgcagccttc 2760 

aaaggtgttt tcttgagaaa atatcaaggc ttaaaagcaa gtgtctacca tgctagtgac 282 0 

aaggaacaca aaaatccact gtgggtcagc ccagaaagct ttaaaggaga taaaaacttt 2880 

aatagtgaca ttagatttgc aaaatcaacg accctgttag gcacagcgtt ttctggaaaa 2940 

tcgttaacag gagctgaatt accagatggg tattatcatt atgtagtgtc ttattaccca 3 0 00 

25 gatgtggtcg gtgccaaacg tcaagaaatg acatttgaca tgattttaga ccgacaaaaa 306 0 

ccggtactat cacaagcaac atttgatcct gaaacaaacc gattcaaacc agaaccccta 312 0 

aaagaccggg gattagctgg tgttcgcaaa gacagtgtct tttatctaga aagaaaagac 3180 

aacaagcctt atacagttac gataaacgat agctacaaat atgtctcagt agaagacaat 3240 

aaaacatttg tggagcgaca agctgatggc agctttatct tgccgcttga taaagcaaaa 33 0 0 

30 ttaggggatt tctattacat ggtcgaggat tttgcaggga acgtggccat cgctaagtta 3360 

ggagatcacc taccacaaac attaggtaaa acaccaatta aacttaagct tacagacggt 342 0 

aattatcaga ccaaagaaac gcttaaagat aatcttgaaa tgacacagtc tgacacaggt 3480 

ctagtcacaa atcaagccca gctagcagtg gtgcaccgca atcagccgca aagccagcta 3540 

acaaagatga atcaggattt ctttatctca ccaaacgaag atgggaataa agacttcgtg 3600 

35 gcctttaaag gcttgaaaaa taacgtatat aatgacttaa cggttaacgt atacgctaaa 3660 

gatgaccacc aaaaacaaac ccctatctgg tctagtcaag caggcgctag tgcatcagct 3720 

attgaaagta cagcctggta tggcataaca gcccgaggaa gcaaggtgat gccaggtgat 3780 

tatcagtatg ttgtgactta tcgtgacgaa catggtaaag aacatcaaaa gcagtacacc 3 840 

atatctgtga atgacaaaaa accaatgatc actcagggac gttttgatac cattaatggc 390 0 

40 gttgaccact ttactcctga caagacaaaa gcccttggct catcaggcat tgtccgcgaa 3960 

gaagtctttt acttggccaa gaaaaatggc cgtaaatttg atgtgacaga aggtaaagat 402 0 

ggtatcacag ttagtgacaa taagatgtat atccctaaaa atccagatgg ttcttacacc 4080 

atttcaaaaa gagatggtgt cacactgtca gattattact accttgtcga agatagagct 4140 

ggtaatgtgt cttttgctac cttgcgtgac ctaaaagcgg tcggaaaaga caaagcagta 42 00 

45 gttaactttg gattagactt accggtccct gaagacaaac aaatagtgaa ctttacttac 4260 

cttgtgcggg atgcagatgg taaaccgatt gaaaacctag agtattataa taactcaggt 4320 

aacagtctta tcttgccata cggcaaatac acggtcgaat tgttgaccta tgacaccaat 4380 

gcagccaaac tagagtcaga taaaatcgtt tcctttacct tatcagctga taataacttc 4440 

caacaagtta cctttaagat gacgatgtta gcaacctctc aaataactgc ccactttgat 4500 

50 catcttttgc cagaaggcag tcgcgttagc cttaaaacag ctcaaggtca gctgatcccg 4560 

cttgaacagt ccttgtatgt gcctaaagct tatggcaaaa ccgttcaaga aggcacttac 4620 

gaagttgttg tcagcctgcc taaaggctac cgtatcgaag gcaacacaaa ggtgaatacc 4680 

ctaccaaatg aagtgcacga actatcatta cgccttgtca aagtaggaga tgcctcagat 4740 

tcaactggcg atcataaggt tatgtcaaaa aataattcac aggctttgac agcctttgcc 4800 

55 acaccaacca agacaacgac ctcagcaaca gcaaaagccc taccatcagc gggtgaaaaa 4860 

atgggtctca agttgcgcat agtaggtctt gtgttactcg gacttacttg cgtctttagc 4920 

cgaaaaaaat caaccaaaga ttga 4944 
<212> Type : DNA 
<211> Length : 4944 

60 SequenceName : SEQ ID 643 

SequenceDescription : 

Sequence 



65 <213> OrganisinName : Treponema pallidum subsp. pallidum str. Nichols 
<400> PreSequenceString : 

atgatgcggt cacttttttc aggtgtgtct ggtatgcaga atcatcaaac gcgcatggat 



60 



WO 2005/076010 



PCT/IN2005/000037 



276/341 



gtcattggga acaacgtcgc gaacgttaac 
tttcaagatc ttatttctca gcaactgagt 
ggagtgaatc ccaaggaagt gggattgggc 
acgcaaggtg cactgcaaac gacgggtatc 
5 ttttttgtgc tgaaaagtgg ggaaaagacg 
gataatgcgg gcactctcgt gaaccctgcg 
caggacgtgg cgggggagcg tttaattaat 
cccattgggc aaaagataga tgcgcagcag 
gacaagcgtc tgcctgagct tgctgcagat 

10 tggacaactg actttcaagt gtatgatagc 
ttttcgcgtg tgccggggac gaacaatcag 
acagaggtag atacgcaaac gcgtgtaggg 
tttattgtaa attttgataa ttttggacac 
■ -gtgaccggtc ctaccggaca ggfcgctcctt 

15 ccggacgatg cagggcaggt tacgcgccac 
accgcgcgca atacgattac gcagtttgct 
gacggttacg cgatgggata tttggaaaat 
ggtgtgtatt caaatggggt gagccaagac 
aatcaaggtg gtcttgagaa ggcaggagag 

20 atagcgaaca ttagcacgtc gggggtgatg 
gagatgagca acgtagattt aaccgatcaa 
tttcaggcgg gcgcaaagac gattcagaca 
ttgaagcgct ga 
<212> Type : DNA 

25 <211> Length : 1392 

SequenceName : SEQ ID 644 
SequenceDe script ion : 



actaccggtt ttaagcgtgg gcgtgttaat 120 

gcggctgcgc gtccgaatga agaagttgga 180 

gtgctgattg caagcatcga tactgttcac 240 

aatacggatg tgtctattca ggggagtggt 300 

tttttcaccc gcgcaggtgc ctttggggtt 3 60 

aatggtatgc gcgttcaagg ttggatggcg 420 

tcctctgcac agacgcagga tctcgttatc 480 

accagcactg ttcactatgc ctgtaattta 540 

gcgaacgaag cggacgtgcg taagtccacg 600 

ttcgggcagc agcatacgtt gcagattaac 660 

tggcaggcca ctgtcgcagt ggatccgggg 720 

gtggggacat ctgacggtgc ggcaaacacc 780 

ctcgcttcag tgactgacac tgcagggaac 840 

gaagcgtcgt acgatgttgt cggtgcgaat 900 

gctttcacgc tcaacttggg tgaaattggc 960 

gaacgcagta ctaccaaagc ctaccggcag 1020 

tttaaaatag atcaaagcgg tgtcatcact 10 80 

attggccagc tcgcacttgc aggatttgca 1140 

aacacctacg tacaatcgaa caactcaggg 1200 

gggaagggaa agttgattgc agggacactt 12 60 

tttacggata tgatcattac ccaaaaaggg 1320 

tcagacacca tgttggatac ggtgttgagt 13 80 

1392 



30 



Sequence 



<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 
<40 0> PreSequenceString : 

atggggtgca tgcggtgggg gagtgtgctg tgtgtggtgg tgggggtagg agcgagcggg 60 

ggagtgctcg gacaggagtt ttccccgaag ctaactggct ctgccacact tgagtggggc 120 

35 atcagctatg gcaagggggt aggcagtcat ggccaggccc ctggtgcagt tatgggcacc 180 

ggtccctaca atctgaagca cgggtttcgt actaccaaca cggtgggagt atcctttccc 240 

ctggttatgc gcaccaccca cacgcgccgt gggcagcacc cggcactgta tgcggagctg 3 00 

aaggtggcgg acctgcaggc ggacctgagt caggggaagg caggttttgc cgttaagcgc 3 60 

aaggggaagg tagaggcgac actacactgt tatggggcct acctgacgat tgggaagaac 42 0 

40 cccacgtttc tgacgaactt tgcccggctg tggaagccgt gggtgacagc gcagtaccag 480 

gaggatgcgg tacagtatgc gccggggttt gggggtttag gcggcaaggt tgggtatcgg 540 

gcacaggaca ttgggggcag tggggtcagc cttgatgtgg ggtttctctc ctttgcctct 60 0 

aacggtgcct gggatagtac tgaccccacg cacagtaagt atggctttgg ggcagacttg 660 

aagctaatgt atgcgcgtgc aggacaccct ctgtgcacgg tagagcttgc cagcaatgtt 720 

45 acgctagaag acggatacct catcggtgca cagaaggacg caaacaatca gaacaaggat 78 0 

aaactgctgt ggaatgtagg gggccgactc accctcgaac caggcgccgg cttccgcttc 840 

tccttcgccc tcgacgccgg taaccaacac cagagtgcac aggactttca aaatcgcaca 900 

cagagggcgc agagtgaact caccgccctc tcaaataacc tcttccaggg agaaagtcaa 960 

aaacaggaag cctgggtaac ccaggtagtg caacaggcga cgcagacagt aacggctgga 1020 

50 gttcgaagcg cgctggaatc tcgggggact acgtacataa acgcgctaga ggcagttcag 1080 

cctaatcctg ctaaacctac cggtaaggtt gtgcaaaatc ttcacacccc gcagggaagt 114 0 

ccgccgaacc tgccgccgct tcctgcactt cctgcatttt ccctgatggg gcaggttttg 120 0 

ctgcagtacg atgcggagca ggtggtgaag gggtttgagc aggtacagac gcaaatcgtc 12 60 

actgaaatta atcagaaagt gcaagcggct gtggcaaaaa ataatgcaaa catgcaagcg 1320 

55 gtcgggggta gtctaggcga tactgcgaga atggtaggcg aagcgctcat taagcagcaa 13 8 0 

ctatcacgta agcagaacag cattctgacc atggtgagcg tgcaagatga ggtgaaacag 144 0 

gatctggcag atttagtgcc gatgatgcga acggaaataa cggcgttttt cgcgagtgtc 1500 

cagcaacaca taaccgaaga agtgaagaag aagacggatg cgttgaatgc ggggcagcag 1560 

atacgtcagg ctatacagaa cctgcgtgcg tctgcatggc gtgcctttct aatgggagtc 1620 

60 agcgccgtgt gtctgtatct tgacacctac aatgtcgcct tcgatgcgct gtttacggcg 168 0 

cagtggaagt ggctgtcttc tggcatatac tttgccacag caccggcaaa cgtttttggc 1740 

accagggtgt tagataacac catcgcaagc tgtggcgact ttgccggatt ccttaagctc 18 0 0 

gaaactaaga gcggtgaccc ctacacccac ctgctcaccg gcctggacgc cggcgttgaa 1860 

acacgcgtgt acatccccct cacccatgac ctgtacaaaa ataataacgg gaaccctctc 192 0 

65 ccttccggcg gttcctcagg gcacattggc ctgccggtgg tggggaaggc gtggtgtagc 1980 

tatcgcatcc cggtgcagga ttacggctgg gtgaagccaa gcgttacggt ccatgcctct 2040 

accaaccgtg cacacctgaa tgcccctgct gcaggtggag cagtaggagc tacctatcta 210 0 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



accaaggagt actgtgcaca gctgcgtgct ggtatttcag ccagtctcat agagaagacg 
gtattctccc ttgattggga acagggtatg ctctctgatg tcccgtacct gctggtgtcc 
gagtgcctca cccagggaat cggccgcatc gtgtgcggcg tcaccctctc ctggtag 

<212> Type : DNA 
<211> Length : 2277 

SequenceName : SEQ ID 645 

SequenceDescription : 



2160 
2220 
2277 



Sequence 



pallidum str. Nichols 



agtatggctt 
cgttggagct 
ccaagaacga 
caggcgccgg 



<213> OrganismName : Treponema pallidum subsp 
<400> PreSequenceString : 

9tgggcaggc aggtgatgca agcgggggta cttgcgggca tggtatgtgc> vtgcttctggt 
tatgcaggcg tactcactcc gcaggtcagt ggcacagccc agctccagtg gggcattgcg 
ttccagaaga atccacgcac tggcccgggc aagcacaccc atgggtttcg cactaccaat 
tttccctgcc gttggtgtca aagcacaccc acacccgccg aggggaggca 
tgtgggcaca gctgcagctg aaggacctgg cagtagagct tgcgtcttct 
cggccctgtc ctttaccaaa cctaccgctt ccttccaggc aaccctgcac 
cctacctgac agtgggtacc agtccttcct gtgtggttaa ctttgcccag 
cctttgtcac ccgtgcctat tcagaaaagg acactcgcta tgcccctggt 
ccggggcaaa actcggctac caggcccaca atgtgggaaa cagcggagta 
tcggtttcct ctccttcctt tccaatggtg cctgggatag tactgacacc 
cggggccgat gcaacgcttt cctatggcgt cgaccgtcag 
ggcagggaat gccacactgg accagaacta cgttaagggt 
aaacaaaaca gcactcctgt ggggagtagg aggccgactc 
cttccgcttc tccttcgccc tcgacgccgg taaccaacac 
cacatgctca gacccaagag agagctatcc tcaaagcaag ggaagtgttt 
cgtgcagaac cttcccaata tcatgatgcc accaggaatc 
agagatggta ggacttgctg ctttgattgc agaaggaacg 
cgtgctagcc gctggcgcgc tcgcggcgct tgtatcgcaa 
cttgtaccga acatagagca aggagtacgt gatgtcttcc gctcttccga tccaagagtt 
gtcactgcta aacttctcgc tttccttgag cgcgcaccta tgaacgcgct caacatagac 
gcgctcctgc gtatgcagtg gaagtggctc tcttctggca tatactttgc caccgcaggc 
actaatatct ttggcaaacg cgtctttgct accactcgtg cgcactactt tgattttgcc 
ggattcctta agctcgaaac caaaagcggt gacccctaca cccacctgct caccggcctg 
aacgccggcg tcgaagcacg cgtgtacatc cccctcacct acatccgtta cagaaataac 
ggagggtacg aactgaatgg agctgtgccc cctgggacta tcaatatgcc aattttgggg 
aaggcgtggt gcagctatcg catccccctc ggttcccacg cctggcttgc accacacaca 
tccgtgctcg gcacaaccaa tcgctttaac attattaacc ccgcgggcaa cctgttgaat 
gaacgagcgc tccagtacca ggtgggactg acgttcagtc ccttcgagaa ggtggagctc 
agcgcccagt gggaacaggg cgtgcttgct gacgctcctt acatgggcat tgccgagagc 
atctggtccg aacgccactt cggcaccctt gtctgcggaa tgaaagtgac atggtaa 

<212> Type : DNA 
<211> Length : 1797 

SequenceName : SEQ ID 646 

SequenceDescription : 



agtctgacta 
cgctcagggg 
aaaagctcaa 
tgttatgggg 
ctgtggaaac 
ttctccggct 
gatgtggaca 
acgcacagca 
cggctgctta 
accgaagact 
accctcgaac 
cagagtaacg 
agacgggtgg aggggaaact 
accgaacaaa ccactctcat 
ctcggcagcg ccattcaaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1797 



Sequence 

<213> OrganismName : Treponema pallidum subsp. 
<4 00> PreSequenceString : 
gtgggcaggc aggtgatgca agcgggggta 
tactcactcc gcaggtcagt 
atccacgcac tggcccgggc 
tttccctgcc gttggtgtca 
tgtgggcaca gctgcagctg 
cggccctgtc ctttaccaaa 
cctacctgac agtgggtacc 
cctttgtcac ccgtgcctat 
ccggggcaaa actcggctac 
tcggtttcct ctccttcctt 
agtatggctt cggggccgat 
cgttggagct ggcagggaat 



pallidum str. Nichols 



tatgcaggcg 
ttccagaaga 
agtctgacta 
cgctcagggg 
aaaagctcaa 
tgttatgggg 
ctgtggaaac 
ttctccggct 
gatgtggaca 
acgcacagca 
cggctgctta 
accgaagact 
accctcgaac 



cttgcgggca tggtatgtgc 
ggcacagccc agctccagtg 
aagcacaccc atgggtttcg 
aagcacaccc acacccgccg 
aaggacctgg cagtagagct 
cctaccgctt ccttccaggc 
agtccttcct gtgtggttaa 
tcagaaaagg acactcgcta 
caggcccaca atgtgggaaa 
tccaatggtg cctgggatag 
gcaacgcttt cctatggcgt 



tgcttctggt 
gggcattgcg 
cactaccaat 
aggggaggca 
tgcgtcttct 
aaccctgcac 
ctttgcccag 
tgcccctggt 
cagcggagta 
tactgacacc 
cgaccgtcag 
cgttaagggt 



gccacactgg accagaacta 
ccaagaacga aaacaaaaca gcactcctgt ggggagtagg aggccgactc 
caggcgccgg cttccgcttc tccttcgccc tcgacgccgg taaccaacac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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cagagtaacg 
agacgggtgg 
accgaacaaa 
ctcggcagcg 
cttgtaccga 
gtcactgcta 
gcgctcctgc 
actaatatct 
ggattcctta 
aacgccggcg 
ggagggtacg 
aaggcgtggt 
tccgtgctcg 
gaacgagcgc 
agcgcccagt 
atctggtccg 



cacatgctca 
aggggaaact 
ccactctcat 
ccattcaaac 
acatagagca 
aacttctcgc 
gtatgcagtg 
ttggcaaacg 
agctcgaaac 
tcgaagcacg 
aactgaatgg 
gcagctatcg 
gcacaaccaa 
tccagtacca 
gggaacaggg 
aacgccactt 



gacccaagag agagctatcc tcaaagcaag 
cgtgcagaac cttcccaata tcatgatgcc 
agagatggta ggacttgctg ctttgattgc 
cgtgctagcc gctggcgcgc tcgcggcgct 
aggagtacgt gatgtcttcc gctctfcccga 
tttccttgag cgcgcaccta tgaacgcgct 
gaagtggctc tcttctggca tatactttgc 
cgtctttgct accactcgtg cgcactactt 
caaaagcggt gacccctaca cccacctgct 
cgtgtacatc cccctcacct acatccgtta 
agctgtgccc cctgggacta tcaatatgcc 
catccccctc ggttcccacg cctggcttgc 
tcgctttaac attattaacc ccgcgggcaa 
ggtgggactg acgttcagtc ccttcgagaa 
cgtgcttgct gacgctcctt acatgggcat 
cggcaccctt gtctgcggaa tgaaagtgac 



ggaagtgttt 
accaggaatc 
agaaggaacg 
tgtatcgcaa 
tccaagagtt 
caacatagac 
caccgcaggc 
tgattttgcc 
caccggcctg 
cagaaataac 
aattttgggg 
accacacaca 
cctgttgaat 
ggtggagctc 
tgccgagagc 
atggtaa 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
174Q; 
1797 



<212> Type : DNA 

<211> Length : 1797 

SequenceName : SEQ ID 647 
SequenceDescription : 

Sequence 



<213> OrganistnName : SARS coronavirus Frankfurt 1 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttafcttctt 180 

ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 42 0 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 720 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 90 0 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 1020 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 10 80 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 0 0 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 13 2 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 18 00 

gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 18 60 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 19 80 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 22 80 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 2400 

gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 
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gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 252 0 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgcfcctagtt 258 0 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 2700 

5 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 2 820 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 2880 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 30 00 

10 agggcttctg cfcaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcaettctfc ggtttattac acagaggaac, ttcttttctc cacaaataat tactacagac 3300 

15 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 3360 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 3420 

catacatcac cagatgttga ttttggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3 600 

20 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<211> Length : 3768 

25 SequenceName : SEQ ID 648 

SequenceDescription : 

Sequence 



30 <213> OrganismName : SARS coronavirus HSR 1 
<400> PreSequenceString : 

atgtttafctt tcttattatt tcttactctc actagtggfca gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt . 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

35 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

40 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

45 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 1020 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 108 0 

50 ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 1200 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 132 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

55 tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 15 00 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 168 0 

60 tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 1800 

gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 1860 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 1980 

65 catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 
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aatatgtaca tctgcggaga ttctactgaa 
ttttgcacac aactaaatcg tgcactctca 
cgtgaagtgt tcgctcaagt caaacaaatg 
ggttttaatt tttcacaaat attacctgac 
5 gaggacttgc tctttaataa ggtgacactc 
gaatgcctag gtgatattaa tgctagagat 
acagtgttgc cacctctgct cactgatgat 
agtggtactg ccactgctgg atggacattt 
gctatgcaaa tggcatatag gttcaatggc 

10 aaccaaaaac aaatcgccaa ccaatttaac 
acaacaacat caactgcatt gggcaagctg 
ttaaacacac ttgttaaaca acttagctct 
gatatccttt cgcgacttga taaagtcgag 
— -ggcagacttc- aaagccttca aacctatgta 

15 agggcttctg ctaatcttgc tgctactaaa 
agagttgact tttgtggaaa gggctaccac 
ggtgttgtct tcctacatgt cacgtatgtg 
ccagcaattt gtcatgaagg caaagcatac 
ggcacttctt ggtttattac acagaggaac 

20 aatacatttg tctcaggaaa ttgtgatgtc 
gatcctctgc aacctgagct tgactcattc 
catacatcac cagatgttga tcttggcgac 
attcaaaaag aaattgaccg cctcaatgag 
gaccttcaag aattgggaaa atatgagcaa 

25 ggcttcattg ctggactaat tgccatcgtc 
agttgttgca gttgcctcaa gggtgcatgc 
gatgactctg agccagttct caagggtgtc 
<212> Type : DNA 
<211> Length. : 3768 

30 SequenceName : SEQ ID 649 

SequenceDescription : 



tgtgctaatt tgcttctcca atatggtagc 222 0 

ggtattgctg ctgaacagga tcgcaacaca 228 0 

tacaaaaccc caactttgaa atattttggt 2340 

cctctaaagc caactaagag gtcttttatt 240 0 

gctgatgctg gcttcatgaa gcaatatggc 2460 

ctcatttgtg cgcagaagtt caatggactt 252 0 

atgattgctg cctacactgc tgctctagtt 2580 

ggtgctggcg ctgctcttca aatacctttt 2640 

attggagtta cccaaaatgt tctctatgag 2700 

aaggcgatta gtcaaattca agaatcactt 2760 

caagacgttg ttaaccagaa tgctcaagca 2820 

aattttggtg caatttcaag tgtgcfcaaat 288 0 

gcggaggtac aaattgacag gttaattaca 2940 

acacaacaac taateagggc tgctgaaatq 3000 

atgtctgagt gtgttcttgg acaatcaaaa 3 060 

cttatgtcct tcccacaagc agccccgcat 3120 

ccatcccagg agaggaactt caccacagcg 3180 

ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ttcttttctc cacaaataat tactacagac 33 00 

gttattggca tcattaacaa cacagtttat ' 3360 

aaagaagagc tggacaagta cttcaaaaat 3420 

atttcaggca ttaacgcttc tgtcgtcaac 3480 

gtcgctaaaa atttaaatga atcactcatt 3 540 

tatattaaat ggccttggta tgtttggctc 3 600 

atggttacaa tcttgctttg ttgcatgact 3 660 

tcttgtggtt cttgctgcaa gtttgatgag 372 0 

aaattacatt acacataa 3 768 



Sequence 



35 <213> Organi smName : SARS coronavirus 20*01 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatfctt tagatcagac actctttatt taactcagga tttatttctt 180 

40 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 24 0 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 300 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 360 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

45 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 720 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

50 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 1080 

55 ttttcaacct ttaagtgtta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 00 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 132 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

60 tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

65 tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 1800 

gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 1860 
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atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 1920 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 19 8 0 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

5 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

ttttgcacac aactaaatcg tgcactctca ggtattgctg cfcgaacagga tcgcaacaca 22 80 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 240 0 

10 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 258 0 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggegtta cccaaaatg.t tctcta£gag 2700 

15 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 282 0 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 288 0 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 294 0 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 300 0 

20 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3 060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 312 0 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 33 00 

25 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 33 60 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 348 0 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 

30 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3 660 

agttgfctgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactcfcg agccagttct caagggtgtc aaattacatt acacataa 3 768 
<212> Type : DNA 
<211> Length : 3768 

35 SeguenceName : SBQ ID 650 

SequenceDescription : 

Sequence 



40 <213> OrganismName : SARS coronavirus TW1 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 12 0 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

45 ccattfctatt ctaatgttac agggt-ttcat actattaatc atacgtttgg caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 3 60 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 420 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

50 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat . 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 720 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

55 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 10 80 

60 ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 00 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 12 60 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 13 20 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

65 tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc atfcaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 



WO 2005/076010 



PCT/IN2005/000037 



282/341 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 168 0 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 18 00 

5 gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc i8 60 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtctfcat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 1980 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

10 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

fctttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 22 8 0 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag ,gtcttttatt 2400 

15 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 24 60 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 25 80 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2 640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 27 00 

20 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 27 60 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 2820 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 28 80 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 3 0 00 

25 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3 060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 3 300 

30 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 3 3 60 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 3420 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaafctgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3 540, 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3 600- 

35 ggcttcattg ctggactaafc tgccatcgtc atggttacaa tctfcgctttg ttgcatgact 3660 

agttgttgca gttgcctcaa gggtgcatgc fccttgtggtt cttgctgcaa gtttgatgag 3 720 

gatgactctg agccagttcfc caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<211> Length : 3 768 

40 SequenceName : SEQ ID 651 

SequenceDescription : 

Sequence 



45 <213> OrganistnName : SARS coronavirus CUHK-SulO 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 

50 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 

55 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 

60 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 
gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 

65 ttttcaacct ttaagtgcta tggcgtttct gccactaagt fcgaatgatct ttgcttctcc 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 



60 
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cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa 
aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa 
tcccctgatg gcaaaccttg caccccacct gctcttaatt gttafctggcc 
tatggttttt acaccactac tggcattggc taccaacctt acagagttgt 
5 tttgaacfctt taaatgcacc ggccacggtt tgtggaccaa aattatccac 
aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt 
tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga 
tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgctc 
gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct 

10 gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc 
atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat 
cafcgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg 
catacagttt ctttattacg tagtactagc caaaaatcta ttgtggcfcta 
ttaggtgctg atagttcaat- tgcttactct aataacacca ttgctatacc 

15 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc 
aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca 
ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga 
cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa 
ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag 

20 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa 
gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt 
acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc 
agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca 
gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt 

25 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca 
acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa 
ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag 
gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag 
ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc 

30 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg 
agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc 
ggtgttgtct tcctacatgt cacgtatgtg ccafccccagg agaggaactt 
ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt 
ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat 

35 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa 
gatcctctgc aacctgagcfc fcgactcattc aaagaagagc tggacaagta 
catacatcac cagatgttga tctfcggcgac atttcaggca ttaacgcttc 
attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga 
gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta 

40 ggcttcafctg ctggactaat tgccatcgtc atggttacaa tcttgctttg 
agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa 
gatgactctg agccagttct caagggtgtc aaattacatt acacataa 
<212> Type : DNA 
<211> Length : 3768 

45 SequenceName : SEQ ID 652 

SequenceDescription : 



ttataaatat 
tgtgcctttc" 
attaaatgat 
agtactttct 
tgaccttatt 
gttaactcct 
tttcactgat 
ttttgggggt 
atatcaagat 
agcttggcgc 
aggagctgag 
tgctagttac 
tactatgtct 
tactaaettt 
cgtagattgt 
atatggtagc 
tcgcaacaca 
atattttggt 
gtcttttatt 
gcaatatggc 
caatggactt 
tgctcfcagtt 
aatacctttt 
tctctatgag 
agaatcactt 
tgcfccaagca 
tgtgctaaat 
gttaattaca 
tgctgaaatc 
acaatcaaaa 
agccccgcat 
caccacagcg 
fcgfcgtttaat 
tactacagac 
cacagtttat 
cttcaaaaat 
tgtcgtcaac 
atcactcatt 
tgtttggctc 
ttgcatgact 
gtttgatgag 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 

iaeo 

1920 
1980 
2040 
-2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3768 



Sequence 



50 <213> OrganismName : SARS coronavirus Urbani 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga 

55 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat 

actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt 

60 ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt 

65 acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc 



ccggtgcacc 
gaggggggtt 
tttatttctt 
caaccctgtc 
tgtccgtggt 
taacaattct 
ctttgcfcgtt 
taattgcact 
taattttaaa 
taagggctat 
acctattttt 
cttttcacct 
aaagccaact 
ttgttctcaa 
aggaatttac 
taatattaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



WO 2005/076010 



PCT/IN2005/000037 



284/341 



aacttgtgtc cttttggaga ggtttttaat 
gagagaaaaa aaatttctaa ttgtgttgct 
ttttcaacct ttaagtgcta tggcgtttct 
aatgtctatg cagattcttt tgtagtcaag 
5 caaactggtg ttattgctga ttataattat 
cttgcttgga atactaggaa cattgatgct 
aggtatctta gacatggcaa gcttaggccc 
tcccctgatg gcaaaccttg caccccacct 
tatggttttt acaccactac tggcattggc 

10 tttgaacttt taaatgcacc ggccacggtt 
aagaaccagt gtgtcaattt taattttaat 
tcttcaaaga gatttcaacc atttcaacaa 
tccgttcgag atcctaaaac atctgaaata 
gtaagtgtaa ttacacctgg aacaaatgct 

15 gttaactgca ctgatgtttc tacagcaatt 
atatattcta ctggaaacaa tgtattccag 
catgtcgaca cttcttatga gtgcgacatt 
catacagttt ctttattacg tagtactagc 
ttaggtgctg atagttcaat tgcttactct 

20 tcaattagca ttactacaga agtaatgcct 
aatatgtaca tctgcggaga ttctactgaa 
ttttgcacac aactaaatcg tgcactctca 
cgtgaagtgt tcgctcaagt caaacaaatg 
ggttttaatt tttcacaaat attacctgac 

25 gaggacttgc tctttaataa ggtgacactc 
gaatgcctag gtgatattaa tgctagagat 
acagtgttgc cacctctgct cactgatgat 
agtggtactg ccactgctgg atggacattt 
gctatgcaaa tggcatatag gttcaatggc 

30 aaccaaaaac aaatcgccaa ccaatttaac 
acaacaacat caactgcatt gggcaagctg 
ttaaacacac ttgttaaaca acttagctct 
gatatccttt cgcgacttga taaagtcgag 
ggcagacttc aaagccttca aacctatgta 

35 agggcttctg ctaatcttgc tgctactaaa 
agagttgact tttgtggaaa gggctaccac 
ggtgttgtct tcctacatgt cacgtatgtg 
ccagcaattt gtcatgaagg caaagcatac 
ggcacttctt ggtttattac acagaggaac 

40 aatacatttg tctcaggaaa ttgtgatgtc 
gatcctctgc aacctgagct cgactcattc 
catacatcac cagatgttga tcttggcgac 
attcaaaaag aaattgaccg cctcaatgag 
gaccttcaag aattgggaaa atatgagcaa 

45 ggcttcattg ctggactaat tgccatcgtc 
agttgttgca gttgcctcaa gggtgcatgc 
gatgactctg agccagttct caagggtgtc 
<212> Type : DNA 
<211> Length : 3768 

50 SeguenceNarae : SEQ ID 653 

SequenceDe script ion : 



gctactaaat tcccttctgt ccacgcaugg jiuav 

gattactctg tgctctacaa ctcaacattt 1080 

gccactaagt tgaatgatct ttgcttctcc 1140 

ggagatgatg taagacaaat agcgccagga 1200 

aaattgccag atgatttcat gggttgtgtc 1260 

acttcaactg gtaattataa ttataaatat 1320 

tttgagagag acatatctaa tgtgcctttc 13 8 0 

gctcttaatt gttattggcc attaaatgat 1440 

taccaacctt acagagttgt agtactttct 1500 

tgtggaccaa aattatccac tgaccttatt 1560 

ggactcactg gtactggtgt gttaactcct 1620 

tttggccgtg atgtttctga tttcactgat 1680 

ttagacattt caccttgctc ttttgggggt 1740 

tcatctgaag, ttgcfcgttct atatcaagat 18-00 

catgcagatc aactcacacc agcttggcgc 18 60 

actcaagcag gctgtcttat aggagctgag 1920 

cctattggag ctggcatttg tgctagttac 1980 

caaaaatcta ttgtggctta tactatgtct 2040 

aataacacca ttgctatacc tactaacttt 2100 

gtttctatgg ctaaaacctc cgtagattgt 2160 

tgtgctaatt tgcttctcca atatggtagc 2220 

ggtattgctg ctgaacagga tcgcaacaca 22 80 

tacaaaaccc caactttgaa atattttggt 2340 

cctctaaagc caactaagag gtcttttatt 2400 

gctgatgctg gcttcatgaa gcaatatggc 2460 

ctcatttgtg cgcagaagtt caatggactt 2520 

atgattgctg cctacactgc tgctctagtt 2580 

ggtgctggcg ctgctcttca aatacctttt 2640 

attggagtta cccaaaatgt tctctatgag 2700 

aaggcgatta gtcaaattca agaatcactt 2760 

caagacgttg ttaaccagaa tgctcaagca 2 820 

aattttggtg caatttcaag tgtgctaaat 2880 

gcggaggtac aaattgacag gttaattaca 2940 

acacaacaac taatcagggc tgctgaaatc 3 000 

atgtctgagt gtgttcttgg acaatcaaaa 3060 

cttatgtcct tcccacaagc agccccgcat 3120 

ccatcccagg agaggaactt caccacagcg 3180 

ttccctcgtg aaggtgttttr tgtgtttaat 3240 

ttcttttctc cacaaataat tactacagac 3300 

gttattggca tcattaacaa cacagtttat 3360 

aaagaagagc tggacaagta cttcaaaaat 3420 

atttcaggca ttaacgcttc tgtcgtcaac 3480 

gtcgctaaaa atttaaatga atcactcatt 3540 

tatattaaat ggccttggta tgtttggctc 3600 

atggttacaa tcttgctttg ttgcatgact 366 0 

tcttgtggtt cttgctgcaa gtttgatgag 3720 

aaattacatt acacataa 3768 



Sequence 



55 <213> Organ! smName : SARS corona 
<400> PreSequenceString : 
atgtttattt tcttattatt tcttactctc 
acttttgatg atgttcaagc tcctaattac 
tactatcctg atgaaatttt tagatcagac 

60 ccattttatt ctaatgttac agggtttcat 
atacctttta aggatggtat ttattttgct 
tgggtttttg gttctaccat gaacaacaag 
actaatgttg ttatacgagc atgtaacttt 
tctaaaccca tgggtacaca gacacatact 

65 ttcgagtaca tatctgatgc cttttcgctt 
cacttacgag agtttgtgtt taaaaataaa 
caacctatag atgtagttcg tgatctacct 



actagtggta gtgaccttga ccggtgcacc 60 

actcaacata cttcatctat gaggggggtt 120 

actctttatt taactcagga tttatttctt 180 

actattaatc atacgtttgg caaccctgtc 240 

gccacagaga aatcaaatgt tgtccgtggt 3 00 

tcacagtcgg tgattattat taacaattct 360 

gaafctgtgtg acaacccttt ctttgctgtt 420 

atgatattcg ataatgcatt taattgcact 480 

gatgtttcag aaaagtcagg taattttaaa 540 

gatgggtttc tctatgttta taagggctat 600 

tctggtttta acactttgaa acctattttt 660 
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aagttgcctc ttggtattaa cattacaaat 
gctcaagaca tttggggcac gtcagctgca 
acatttatgc tcaagtatga tgaaaatggt 
aatccacttg ctgaactcaa atgctctgtt 
5 cagacctcta atttcagggt tgttccctca 
aacttgtgtc cttttggaga ggtttttaat 
gagagaaaaa aaatttctaa ttgtgttgct 
ttttcaacct ttaagtgcta tggcgtttct 
aatgtctatg cagattcttt tgtagtcaag 

10 caaactggtg ttattgctga ttataattat 
cttgcttgga atacfcaggaa cattgatgct 
aggtatctta gacatggcaa gctfcaggccc 
tcccctgatg gcaaaccttg caccccacct 
tatggttttt a.caccactac tggcattggc 

15 tttgaacttt taaatgcacc ggccacggtt 
aagaaccagt gtgtcaattt taattttaat 
tcttcaaaga gatttcaacc atttcaacaa 
tccgttcgag atcctaaaac atctgaaata 
gtaagtgtaa ttacacctgg aacaaatgct 

20 gttaacfcgca ctgatgtttc tacagcaatt 
atatattcta ctggaaacaa tgtattccag 
catgtcgaca cttcttatga gtgcgacatt 
catacagttt ctttattacg tagtactagc 
ttaggtgctg atagttcaat tgcttactct 

25 tcaattagca ttactacaga agtaatgcct 
aatatgtaca tctgcggaga ttctactgaa 
ttttgcacac aactaaatcg tgcactctca 
cgtgaagtgt tcgctcaagt caaacaaatg 
ggttttaatt tttcacaaat attacctgac 

30 gaggactfcgc tctttaataa ggtgacactc 
gaatgcctag gtgatattaa tgctagagat 
acagtgttgc cacctctgct cactgafcgat 
.agtggtactg ccactgctgg atggacattt 
gctatgcaaa tggcatatag gttcaatggc 

35 aaccaaaaac aaatcgccaa ccaatttaac 
acaacaacat caactgcatt gggcaagctg 
ttaaacacac ttgttaaaca acttagctct 
gatatccttt cgcgacttga taaagtcgag 
ggcagacttc aaagccfctca aacctatgta 

40 agggcttctg ctaatcttgc tgctactaaa 
agagttgact tfctgtggaaa gggctaccac 
ggtgttgtct tcctacatgt cacgtatgtg 
ccagcaattt gtcatgaagg caaagcatac 
ggcacttctt ggtttattac acagaggaac 

45 aatacatttg tctcaggaaa ttgtgatgtc 
gatcctctgc aacctgagct tgactcattc 
catacatcac cagatgttga tcttggcgac 
attcaaaaag aaattgaccg cctcaatgag 
gaccttcaag aattgggaaa atatgagcaa 

50 ggcttcattg ctggactaat tgccatcgtc 
agttgttgca gttgcctcaa gggtgcatgc 
gatgactctg agccagttct caagggtgtc 
<212> Type : DNA 
<211> Length : 3768 

55 SequenceName : SEQ ID 654 

SequenceDescription : 



tttagagcca ttcttacagc cttttcacct 72 0 

gcctattttg ttggctattt aaagccaact 780 

acaatcacag atgctgttga ttgttctcaa 840 

aagagctttg agattgacaa aggaatttac 900 

ggagatgttg tgagattccc taatattaca 960 

gctactaaat tcccttctgt ctatgcatgg 1020 

gattactctg tgctctacaa ctcaacattt 1080 

gccactaagt tgaatgatct ttgcttctcc 114 0 

ggagatgatg taagacaaat agcgccagga 12 0 0 

aaattgccag atgatttcat gggttgtgtc 12 60 

acttcaactg gtaattataa ttataaatat 1320 

tttgagagag acatatctaa tgtgcctttc 13 8 0 

gctcttaatt gttattggcc attaaatgat 1440 

taccaacctt acagag&tgt agtactttol;, 150.0... 

tgtggaccaa aattatccac tgaccttatt 1560 

ggactcactg gtactggtgt gttaactcct 1620 

tttggccgtg atgtttctga tttcactgat 1680 

ttagacattt caccttgcgc ttttgggggt 1740 

tcatctgaag ttgctgttct atatcaagat 1800 

catgcagatc aactcacacc agcttggcgc 1860 

actcaagcag gctgtcttat aggagctgag 1920 

cctattggag ctggcatttg tgctagttac 1980 

caaaaatcta ttgtggctta tactatgtct 204 0 

aataacacca ttgctatacc tactaacttt 2100 

gtttctatgg ctaaaacctc cgtagattgt 2160 

tgtgctaatt tgcttctcca atatggtagc 2220 

ggtattgctg ctgaacagga tcgcaacaca 22 80 

tacaaaaccc caactttgaa atattttggt 2340 

cc tctaaagc caactaagag gtcttttatt 2400 

gctgatgctg gcttcatgaa gcaatatggc 24 60 

ctcatttgtg cgcagaagtt caatggactt 2520 

atgattgctg cctacactgc tgctctagtt 2580 

ggtgctggcg ctgctcttca aatacctttt 2640 

attggagtta cccaaaatgt tctctatgag 2700 

aaggcgatta gtcaaattca agaatcactt 2760 

caagacgttg ttaaccagaa tgctcaagca 2820 

aattttggtg caatttcaag tgtgctaaat 2880 

gcggaggtac aaattgacag gttaattaca 2940 

acacaacaac taatcagggc tgctgaaatc 30 00 

atgtctgagt gtgttcttgg acaatcaaaa 3 060 

cttatgtcct tcccacaagc agccccgcat 3120 

ccatcccagg agaggaactt caccacagcg 3180 

ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ttcttttctc cacaaataat tactacagac 33 00 

gttattggca tcattaacaa cacagtttat 3360 

aaagaagagc tggacaagfca cttcaaaaat 342 0 

atttcaggca ttaacgcttc tgtcgtcaac 3480 

gtcgctaaaa atttaaatga atcactcatt 3540 

tatattaaat ggccttggta tgtttggctc 3600 

atggttacaa tcttgctttg ttgcatgact 3660 

tcttgtggtt cttgctgcaa gtttgatgag 3720 

aaattacatt acacataa 3768 



Sequence 



60 <213> OrganismName : SARS coronavirus TOR2 
<400> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga 
acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat 
tactatcctg atgaaatttt tagatcagac actctttatt taactcagga 
65 ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg 
atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt 
tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat 



ccggtgcacc 
gaggggggtt 
tttatttctt 
caaccctgtc 
tgtccgtggt 
taacaattct 



60 
120 
180 
240 
300 
360 
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actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 42 0 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 54 0 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

5 caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 78 0 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

10 cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 10 8 0 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 114 0 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga .1200 

15 caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 1260 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 1320 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 144 0 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

20 tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 162 0 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atcctaaaac atctgaaata ttagacattt caccttgcgc ttttgggggt 174 0 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 1800 

25 gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 1860 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 192 0 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 198 0 

catacagttt; ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2 040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

30 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 222 0 

ttttgcacac aactaaatcg fcgcactctca ggtattgctg ctgaacagga tcgcaacaca 228 0 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggtttrtaafct tttcacaaat attaccrtgac cctctaaagc caactaagag gtcttttatt 2400 

35 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgotctagtt 2580 

agtggtactg ccactgctgg atggacattt ggtgcfcggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctrctatgag 2700 

40 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat craactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 282 0 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 288 0 

gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 294 0 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 3000 

45 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3 060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 324 0 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 3300 

50 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 3 3 60 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 348 0 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3 600 

55 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3 660 

agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<211> Length : 3768 
60 SequenceNarae : SEQ ID 655 
SequenceDescription : 

Sequence 



65 <213> OrganismName : SARS coronavirus GD01 
<40 0> PreSequenceString : 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 
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acttttgatg atgttcaagc tcctaattac 

tactatcctg atgaaatttt tagatcagac 

ccattttatt ctaatgttac agggtttcat 

atacctttta aggatggtat ttattttgct 

5 tgggtttttg gttctaccat gaacaacaag 

actaatgttg ttatacgagc atgtaacttt 

tctaaaccca tgggtacaca gacacatact 

ttcgagtaca tatctgatgc cttttcgctt 

cacttacgag agtttgtgtt taaaaataaa 

10 caacctatag atgtagttcg tgatctacct 

aagttgcctc ttggtattaa cattacaaat 

gctcaagaca cttggggcac gtcagctgca 

acatttatgc tcaagtatga tgaaaatggt 

aatecaettg ctgaactcaa atgatctgtt 

15 cagacctcta atttcagggt tgttccctca 

aacttgtgtc cttttggaga ggtttttaat 

gagaggaaaa gaatttctaa ttgtgttgct 

ttttcaacct ttaagtgcta tggcgtttct 

aatgtctatg cagafctcttt tgtagtcaag 

20 caaactggtg ttattgctga ttataattat 

cttgcttgga atactaggaa cattgatgct 

aggtatctta gacatggcaa gcttaggccc 

tcccctgatg gcaaaccttg caccccacct 

tatggttttt acaccactac tggcattggc 

25 tatgaacttt taaatgcacc ggccacggtt 

aagaaccagt gtgtcaattt taattttaat 

tcttcaaaga gatttcaacc atttcaacaa 

tccgttcgag atcctaaaac atctgaaata 

gtaagtgtaa ttacacctgg aacaaatgct 

30 gttaactgca ctgatgtttc tacagcaatt 

atatattcta ctggaaacaa fcgtattccag 

catgtcgaca cttcttatga gtgcgacatt 

catacagttt ctttattacg tagtactagc 

ttaggtgctg atagttcaat tgcttactct 

35 tcaattagca ttactacaga agtaatgcct 

aatatgtaca tctgcggaga ttctactgaa 

ttttgcacac aactaaafccg tgcactctca 

cgtgaagtgt tcgctcaagt caaacaaatg 

ggttttaatt tttcacaaat attacctgac 

40 gaggacttgc tctttaataa ggtgacactc 

gaatgcctag gtgatattaa tgctagagat 

acagtgttgc cacctctgct cactgatgat 

agtggtactg ccactgctgg atggacattt 

gctatgcaaa tggcatatag gttcaatggc 

45 aaccaaaaac aaatcgccaa ccaatttaac 

acaacaacat caactgcatt gggcaagctg 

ttaaacacac ttgttaaaca acttagctct 

gatatccttt cgcgacttga taaagtcgag 

ggcagacttc aaagccttca aacctatgta 

50 agggcttctg ctaatcttgc tgctactaaa 

agagttgact tttgcggaaa gggctaccac 

ggtgttgtct tcctacatgt cacgtatgtg 

ccagcaattt gtcatgaagg caaagcatac 

ggcacttctt ggtttattac acagaggaac 

55 aatacatttg tctcaggaaa ttgtgatgtc 

gatcctctgc aacctgagct tgactcattc 

catacatcac cagatgttga tcttggcgac 

attcaaaaag aaattgaccg cctcaatgag 

gaccttcaag aattgggaaa atatgagcaa 

60 ggcttcattg ctggactaat fcgccatcgtc 

agttgttgca gttgcctcaa gggtgcatgc 

gatgactctg agccagttct caagggtgtc 

<212> Type : DNA 
<211> Length : 3768 

65 SequenceNarae : SEQ ID 656 

SequenceDe script ion : 



actcaacata cttcatctat gaggggggtt 12 0 

actctttatt taactcagga tttatttctt 180 

actattaatc atacgtttga caaccctgtc 240 

gccacagaga aatcaaatgt tgtccgtggt 3 00 

tcacagtcgg tgattattat taacaattct 360 

gaattgtgtg acaacccttt ctttgctgtt 42 0 

atgatattcg ataatgcatt taattgcact 480 

gatgtttcag aaaagtcagg taattttaaa 540 

gatgggtttc tctatgttta taagggctat 600 

tctggtttta acactttgaa acccattttt 660 

tttagagcca ttcttacagc ctttttacct 720 

gcctattttg ttggctattt aaagccaact 780 

acaatcacag atgctgttga ttgttctcaa 84 0 

aagagctttg agattgacaa aggaiatttac , ,.900 

agagatgttg tgagattccc taatattaca 960 

gctactaaat tcccttctgt ctatgcatgg 1020 

gattactctg tgctctacaa ctcaacattt 1080 

gccactaagt tgaatgatct ttgcttctcc 1140 

ggagatgatg taagacaaat agcgccagga 1200 

aaattgccag atgatttcat gggttgtgtc 12 60 

acttcaactg gtaattataa ttataaatat 1320 

tttgagagag acatatctaa tgtgcctttc 1380 

gctcttaatt gttattggcc attaaatgat 1440 

taccaacctt acagagttgt agtactttct 15 00 

tgtggaccaa aattatccac tgaccttatt 1560 

ggactcactg gtactggtgt gttaactcct 162 0 

tttggccgtg atgtttctga tttcactgat 1680 

ttagacattt caccttgctc ttttgggggt • 1740 

tcatctgaag ttgctgttct atatcaagat 1800 

catgcagatc aactcacacc agcttggcgc 18 60 

actcaagcag gctgtcttat aggagctgag 1920 

cctattggag ctggcatttg tgctagttac 1980 

caaaaatcta ttgtggctta tactatgtct 2040 

aataacacca ttgctatacc tactaacttt 2100 

gtttctatgg ctaaaacctc cgtagattgt 2160 

tgtgctaatt tgcttctcca atatggtagc 222 0 

ggtattgctg ctgaacagga tcgcaacaca 2280 

tacaaaaccc caactttgaa agattttggt 2340 

cctctaaagt caactaagag gtcttttatt 2400 

gctgatgctg gcttcatgaa gcaatatggc 2460 

ctcatttgtg cgcagaagtt caatggactt 2520 

atgattgctg cctacactgc tgctctagtt 2580 

ggtgctggcg ctgctcttca aatacctttt 2640 

attggagtta cccaaaatgt tctctatgag 2700 

aaggcgatta gtcaaattca agaatcactt 2 760 

caagacgttg ttaaccagaa tgctcaagca 2820 

aattttggtg caatttcaag tgtgctaaat 2880 

gcggaggtac aaattgacag gttaattaca 2940 

acacaacaac taatcagggc tgctgaaatc 3 0 00 

atgtctgagt gtgttcttgg acaatcaaaa 3 060 

cttatgtcct tcccacaagc agccccgcat 3120 

ccatcccagg agaggaactt caccacagcg 3180 

ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ttcttttctc cacaaataat tactacagac 33 00 

gttattggca tcattaacaa cacagtttat 3 3 60 

aaagaagagc tggacaagta cttcaaaaat 3420 

atttcaggca ttaacgcttc tgtcgtcaac 348 0 

gtcgctaaaa atttaaatga atcactcatt 3 540 

tatattaaat ggccttggta tgtttggctc 3 600 

atggttacaa tcttgctttg ttgcatgact 3660 

tcttgtggtt cttgctgcaa gtttgatgag 3720 

aaattacatt acacataa 3768 
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<213> OrganismName : SARS coronavirus CUHK-W1 
<4 00> PreSequenceString : 

5 atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

ccattttatt ctaatgttac agggtttcat actattaatc atacgtttga caaccctgtc 240 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

10 tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 360 

actaatgttg ttatacgagc atgfcaacttt gaattgtgtg acaacccttt ctttgctgtt 42 0 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 48 0 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

, cacttacgag agtt-tgtgtt taaaaataaa gatgggtttc fcctatgttta taagggctat 6 ; 0,0 . 

15 caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca cttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 840 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 900 

20 cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca 960 

aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt 10 80 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 

aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 00 

25 caaactggtg ttattgctga fctataattat aaattgccag atgatttcat: gggttgtgtc 1260 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttafcaaatat 1320 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 13 80 

tcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

30 tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 1560 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1620 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

tccgttcgag atccrtaaaac atctgaaata ttagacattt caccttgctc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 18 00 

35 gttaactgca cfcgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc 18 60 

atatattcta ctggaaacaa tgfcattccag actcaagcag gctgtcttat aggagctgag 1920 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcafcttg tgctagttac 19 80 

catacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

40 tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 222 0 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 22 80 

cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 23 40 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 24 00 

45 gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 2520 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 25 80 

agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 27 00 

50 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 28 20 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 28 80 
gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca ' 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 30 00 

55 agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 33 00 

60 aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 3360 

gatcctctgc aacctgagct tgactcattc aaagaagagc tggacaagta cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgcttc tgtcgtcaac 3480 

attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 

65 ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3660 

agttgttgca gttgccfccaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 

gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 
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<212> Type : DNA 

<211> Length : 3768 

SequenceName : SEQ ID 657 
SequenceDe script ion : 

Sequence 



<213> OrganismName : SARS coronavirus BJ01 
<400> PreSequenceString : 

10 atgtttattt tcttattatt tcttactctc actagtggta 
acttttgatg atgttcaagc tcctaattac actcaacata 
tactatcctg atgaaatttt tagatcagac actctttatt 
ccattttatt ctaatgttac agggtttcat actattaatc 
atacctttte aggatggtat.,.ttattttgct gccacagaga 

15 tgggtttttg gttctaccat gaacaacaag tcacagtcgg 
actaatgttg ttatacgagc atgtaacttt gaattgtgtg 
tctaaaccca tgggtacaca gacacatact atgatattcg 
ttcgagtaca tatctgatgc cttttcgctt gatgtttcag 
cacttacgag agtttgtgtt taaaaataaa gatgggtttc 

20 caacctatag atgtagttcg tgatctacct tctggtttta 
aagttgcctc ttggtattaa cattacaaat tttagagcca 
gctcaagaca cttggggcac gtcagctgca gcctattttg 
acatttatgc tcaagtatga tgaaaatggt acaatcacag 
aatccacttg ctgaa,ctcaa atgctctgtt aagagctttg 

25 cagacctcta atttcagggt tgttccctca ggagatgttg 
aacttgtgtc cttttggaga ggtttttaat gctactaaat 
gagagaaaaa aaatttctaa ttgtgttgct gattactctg 
ttttcaacct ttaagtgcta tggcgtttct gccactaagt 
aatgtctatg cagattcttt tgtagtcaag ggagatgatg 

30 caaactggtg ttattgctga ttataattat aaattgccag 
cttgcttgga atactaggaa cattgatgct acttcaactg 
aggtatctta gacatggcaa gcttaggccc tttgagagag 
tcccctgatg gcaaaccttg caccccacct gctcttaatt 
tatggttttt acaccactac tggcattggc taccaacctt 

35 tttgaacttt taaatgcacc ggccacggtt tgtggaccaa 
aagaaccagt gtgtcaattt taattttaat ggactcactg 
tcttcaaaga gatttcaacc atttcaacaa tttggccgtg 
tccgttcgag atcctaaaac atctgaaata ttagacattt 
gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag 

40 gttaactgca ctgatgtttc tacagcaatt catgcagatc 
atatattcta ctggaaacaa tgtattccag actcaagcag 
catgtcgaca cttcttatga gtgcgacatt cctattggag 
catacagttt ctttattacg tagtactagc caaaaatcta 
ttaggtgctg atagttcaat tgcttactct aataacacca 

45 tcaattagca ttactacaga agtaatgcct gtttctatgg 
aatatgtaca tctgcggaga ttctactgaa tgtgctaatt 
ttttgcacac aactaaatcg tgcactctca ggtattgctg 
cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc 



ggttttaatt tttcacaaat attacctgac cctctaaagc 
50 gaggacttgc tctttaataa ggtgacactc gctgatgctg 
gaatgcctag gtgatattaa tgctagagat ctcatttgtg 
acagtgttgc cacctctgct cactgatgat atgattgctg 
agtggtactg ccactgctgg atggacattt ggtgctggcg 
gctatgcaaa tggcatatag gttcaatggc attggagtta 
55 aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta 
acaacaacat caactgcatt gggcaagctg caagacgttg 
ttaaacacac ttgttaaaca acttagctct aattttggtg 
gatatccttt cgcgacttga taaagtcgag gcggaggtac 
ggcagacttc aaagccttca aacctatgta acacaacaac 
60 agggcttctg ctaatcttgc tgctactaaa atgtctgagt 
agagttgact tttgtggaaa gggctaccac cttatgtcct 
ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg 
ccagcaattt gtcatgaagg caaagcatac ttccctcgtg 
ggcacttctt ggtttattac acagaggaac ttcttttctc 
65 aatacatttg tctcaggaaa ttgtgatgtc gttattggca 
gatcctctgc aacctgagct tgactcattc aaagaagagc 



catacatcac cagatgttga tcttggcgac atttcaggca 



gtgaccttga 
cttcatctat 
taactcagga 
atacgtttga 
aatcaaatgt 
tgattattat 
acaacccttt 
ataatgcatt 
aaaagtcagg 
tctatgttta 
acactttgaa 
ttcttacagc 
ttggctattt 
atgctgttga 
agattgacaa 
tgagattccc 
tcccttctgt 
tgctctacaa 
tgaatgatct 
taagacaaat 
atgatttcat 
gtaattataa 
acatatctaa 
gttattggcc 
acagagttgt 
aattatccac 
gtactggtgt 
atgtttctga 
caccttgctc 
ttgctgttct 
aactcacacc 
gctgtcttat 
ctggcatttg 
ttgtggctta 
ttgctatacc 
ctaaaacctc 
tgcttctcca 
ctgaacagga 
caactttgaa 
caactaagag 
gcttcatgaa 
cgcagaagtt 
cctacactgc 
ctgctcttca 
cccaaaatgt 
gtcaaattca 
ttaaccagaa 
caatttcaag 
aaattgacag 
taatcagggc 
gtgttcttgg 
tcccacaagc 
agaggaactt 
aaggtgtttt 
cacaaataat 
tcattaacaa 
tggacaagta 
ttaacgcttc 



ccggtgcacc 
gaggggggtt 
tttatttctt 
caaccctgtc 
tgtccgtggt 
taacaattct 
ctttgctgtt 
taattgcact 
taattttaaa 
taagggctat 
acctattttt 



cttttcacct 
aaagccaact 
ttgttctcaa 
aggaatttac 
taatattaca 
ctatgcatgg 
ctcaacattt 
ttgcttctcc 
agcgccagga 
gggttgtgtc 
ttataaatat 
tgtgcctttc 
attaaatgat 
agtactttct 
tgaccttatt 
gttaactcct 
tttcactgat 
ttttgggggt ' 
atatcaagat 
agcttggcgc 
aggagctgag 
tgctagttac 
tactatgtct 
tactaacttt 
cgtagattgt 
atatggtagc 
tcgcaacaca 
atattttggt 
gtcttttatt 
gcaatatggc 
caatggactt 
tgctctagtt 
aatacctttt 
tctctatgag 
agaatcactt 
tgctcaagca 
tgtgctaaat 
gttaattaca 
tgctgaaatc 
acaatcaaaa 
agccccgcat 
caccacagcg 
tgtgtttaat 
tactacagac 
cacagtttat 
cttcaaaaat 
tgtcgtcaac 



60 
120 
180 
240 
-3,0 0 . 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
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attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atttaaatga atcactcatt 3540 
gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 
ggcfctcattg ctggactaat tgccatcgtc atggttacaa tcttgctfctg ttgcatgact 3660 
agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 3720 
gatgacfcctg agccagttct caagggtgtc aaattacatt acacataa 3768 
<212> Type : DNA 
<21X> Length : 3768 

SequenceName : SEQ ID 658 

SequenceDescription : 



Sequence 

<213> ,Organi sreName Escherichia. 
15 ■< <40O> PreSequenceString : 

atgaacaaaa tatttaaagt tatctggaac 
gaaacggcaa aaagccgtgg caagaaatct 
gttcgcgggtg gaatgtfcgtc gtcgtttggg 
cagggtgttg attacggtag tggatcagct 
20 gcgaaagcaa atacttttat gaacaccagt 
atagctgaag gccaatatag ctctgccatc 
tcaatggcct ttggggttag tgcaatatca 
tcttcgtatt cattgggcca atactcaatg 
aaattgtcta ttgctatggg ggactcttcc 

25 ggaaatgcca ctaaagctac tgagattatg 
tcaaaagcgt attcaatggc gctgggagca 
gcgataggtg ctgagaccga agccgctgaa 
gcaaaaggga ctaatagcat ggcaatgggg 
atcg-cattag gaaatggcag ccaggctctg 

30 aacaaagctg atggcgtgga tgccatcgct 
aacaccattg ccttaggcac agccagfcaat 
agtaatagca gtgccaacgg fcattaactct 
ttacjacaata ccgtctctgt cggcaatagfc 
aatcjgcgcga tcaagtctga cragttacgat 

35 agccjactcgg tagcaaaaag gcttggagga 
gfctSLcagcac caacctacaa tttaaaaaat 
gctcjfcacttg atgaaaacac cctgcaatgg 
catcjgtacta gtagcccaac tgccagcgta 
gcctccagta aggatgcggt taacggttcc 

40 gccaacaccg ccaatatcgc tactaatacc 
gccaccaata ccaccaatat caccaacctg 
gcccrtgctct ggaacgaaac taaaaaggca 
agcaaaatca ccaacgttaa agatgccgac 
ggctctcagc tgaaaaccac caacgatgct 

45 aacacttcca atattgccac taacaccacc 
aatcttggtg aggatgcgct gaaatgggat 
ggcaccgaga ccaccagcaa aatcaccaac 
acccjatgccg ttaacggctc tcagctgaaa 
accaatatcg ccactaacac caccaacatc 

50 ggtgaggatg cgctgaaatg ggataaggac 
aataccgcca gcaaaatcac caatatcctg 
gccafctaacg gtagccagct ttatgactta 
aatgjcttctg tgaatactga cggtgtgttt 
aattattata acgtcggcga tgcactggct 

55 ggcgjatgctc tgctttggga tgccaccgca 
ggtgacgcaa gcgtgatcac tgatgtcgca 
gcagtaaacg gctcacaact ccacggcgtg 
ggtgccgaag tcaatgcaga cggcaccatc 
gattacgata atgtcggtga tgccctgaat 

60 ctctgggatg cggacgccgg tgaaaatggt 
actgccagtg taatcactaa cgtcgctaac 
attaacggct cacaactcta taccaccaat 
gcagaagtca acgctgacgg caccatcacc 
tacaacaacg tcggtgacgc cctggatgcg 

65 actg-ccaatg gcggtgctgg agcctacaat 
actaatgtcg ctaatggcag tattagtgag 
ttgaatgcga cgaatatgat gattgagcag 



coli 0157:H7 



cctgcgacag 

gggcgcagta 

gcattggcga 
ggcgacggct 
ggttccagta 
gggtcaaaaa 
gaaggcgata 
gccctcggcc 
aaagcggaag 
agtattgctc 
agtagcgtcg 
aatgcaactg 
ttcggaagcc 
gcagataatg 
ctgggtaatg 
gcaactggtg 
gtcgcgctgg 
tcattaaaac 
gccattaatg 

ggggctgcag 

ggtagcaaaa 
gaccaaacca 
atcaccgatg 
caactgaaag 
agcaacattg 
acggattccg 
ttcagtgcag 
ctgacggctg 
gtggcgacga 
aacatctcta 
aaggacaatg 
gttaaagatg 
accaccaacg 
tctaatctga 
aatggtgtct 
gacggcacag 
agcagcaata 
accggtccaa 
gcgattaact 
ggtaaattca 
gatggtgaaa 
agcagttatg 
actgcgccga 
gctatcgata 
gcatttagcg 
ggtgcaatct 
aagtacatcg 
gcaccgactt 
cttgatgata 
gccagccatg 
gacagtaccg 
aacacccaaa 



ggaattatac 
agctgttaat 
atgccgggaa 
gggttgctat 
ctgctgtggg 
cccatgcgat 
gaagtatagc 
gttattcaaa 
gagcaaacgc 
ttggcgacac 
catctgaaga 
ctattggcaa 
ttgccgataa 
caatcgccat 
gtagccagtc 
ataagagtct 
gcgcagattc 
gcaagatcgt 
gttcacagct 
tagatgttga 
ataacgtagg 
aaggcaaata 
ttgcggatgg 
ctaccaatga 
ccacgaatac 
ttggtgacct 
ctcacggcca 
acagcactga 
ataccaccaa 
acctgactga 
gtgtattcac 
gcgacctgac 
atgccgtggc 
ctgagacggt 
tcactgcagc 
tcactgcaac 
tcgccaccta 
cctacaaaat 
cctcatttag 
gtgccaaaca 
tttcagactc 
ttgttgatgc 
cgtacaccat 
ccactcttga 
ccgctcacgg 
ctgctgccag 
ctgatgcgct 
acaccattgc 
acgccctgct 
acggtaaagc 
atgcagtgaa 
ttatcaatca 



tgttaccagc 

ttctgcgctg 

tgacaacggt 

aggcaaaggg 

ttatgacgct 

tggtggtgca 

actgggtgcc 

agcattgggt 

cattgccctg 

cgccaatgcg 

aaacgctatt 

taatgcgaag 

agtcaatact 

aggccagggc 

gagaggctta 

tgcgcttggt 

cattgcggat 

taatgtgaaa, 

ttatgccatt 

tgacggtact 

ggctgcgctc 

cagcgctgct 

cacgatttca 

cgatgtcgaa 

ggcaaatatt 

tcaggctgat 

ggataccacc 

tgctgttaac 

tatcgccaat 

gacggtgact 

ggcagctcat 

gactggcagc 

gacgaatacc 

gactaatctt 

tcatggcaac 

cagttccgat 

cttcggcggc 

cggtgaaaca 

cacgtctctc 

cggtactaat 

cagttctgac 

gctggggggt 

tgctaatgct 

cgacgctctg 

aaaagataaa 

cagcgacgcg 

gggtggtgac 

gaacgccgag 

gtgggatgag 

cagcatcatc 

cggttctcag 

gctcgctggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
:13 8 0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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aacaccgacg caacctatat ccaagaaaac 
gacgacggct tagcgttcaa cgacgccagc 
tataactctg tcgccaaagg cgatagcagc 
gttgatacgg gtatcgccct gggtagcagc 
5 tcccgtgaca ccagcataac ggaaaatggc 
gaactgctcg gtgcattgtc tatcggtgat 
gccgatggtt ccgaagccca tgacgccgtt 
gcggtcgcaa ccacgccgac taaatacttc 
gcagtgggaa ctgactcgct ggcaatgggt 

10' attggtatcg gttatggtgc ctacgtggac 
agcaatgcgc aagtcattca tgtcaacagt 
cgtggcgctc aaaccaatta taccgcctac 
gaattctcag tcggtagtgc ggatggtcaa 
gctgataccg atgcggtcaa cgtgggtqag 

15 aatacccaga gcattactaa cctggataat 
aatatcgaaa acggtattgg cgatatcgtc 
aataccgatg gtgtagatgc cagcgcgcag 
tccattgctg ccgctgacaa cagcgtcgct 
aatacgatct ctgtaggttc ctctactaac 

20 aaaaatgcta ccgatgctgt taacgtggca 
cgttacgaca ccaaagctga tggttctatc 
aacggcggta cgactcgtat cagcaacgtc 
aattacgcgc agttgaagca aagcgtgcag 
gttgagatgg ataacaaact gtctaaaact 

25 gcaatggcaa tgaccggtct gccgcaggct 
ggtggcggta cttacaacgg tgaatcggca 
aatggtcgtt gggtctacaa attacaaggt 
gcactcggtg ccggtattca gtggtaa 
<212> Type : DNA 

30 <211> Length : 4767 

SequenceName : SEQ ID 659 
SequenceDe script ion : 

Sequence 
35 

<213> OrganisraName : Escherichia 
<40O> PreSequenceString : 
gtgccagctt ccgcagtagg tgcgctgggc 
accgacagcg caggcaacag caattccgcc 

40 cctggcgtca ccattaaccc agttgcgacc 
aatgcgcaaa ccatcagcgg ccaggtgacg 
acgcttggcg ggaaaactta caccgctacc 
gttccggcgg cggatattca ggccatcggc 
accaacggcg ttggcaatac tggcagcggt 

45 ccaggtctgc gcgttgacac cgtggcgggc 
caggcgctgg tgataactgg tagcagcagc 
gtgattaaca cggtcactta cgctgcaaca 
gttccggcgg cagacgtgag taactggcct 
actaacacag ccggaacaac atccaccatc 

50 gtggcgattt ccattaacac cgtttccggc 
gcagatttaa ccctttccgg cagcacctcc 
acctttggcg ggaaaaccta caccgctacc 
gtacccgccg ccgatctcag cgtgttacgc 
agcactatta acggcaacac ggcttcggca 

55 ccgacgcttg ccattaacac catcgccacc 
aatccgttaa ccatcagcgg tagcagcacc 
cttaatggtg tgacttacag cggctccgtc 
ccgacggcgg atctcagcaa tctgaccgcc 
gataaagcgg gtaacccggc gtccgctaac 

60 gtgctgacca tcaacaccgt ctccggcgat 
gcgctggtga tctccggctc cagcactggc 
ctaaacagta aaacctacac caccatgctg 
ccggcggctg acgtcactgc gcttggcagc 
gacgcggcag gcaacagcga tgacgccagc 

65 accattggta tcaacaccat cgccaccgat 
gacctgcaaa tcaccggcac cagtaatcag 
aacgggcaaa attacaccgc tactaccgat 



291/341 

ggtgcgggta ttaactatgt gcgtactaac 3180 

gcacagggtg ttggcgctac agctataggt 3240 

gtagctattg gtcagggcag ctacagcgac 3300 

tctgtttcca gccgagtgat tgccaaaggc 33 60 

gttgttattg gttacgacac cacggatggc 3420 

gacggtaaat atcgtcaaat catcaacgta 348 0 

acggttcgtc aattgcagaa tgcgattggt 3 540 

cacgctaatt caacggaaga agattcactg 3 600 

gcgaaaacca tcgtgaatgg cgataaaggt 3660 

gcgaatgcac ttaacggcat tgccattggt 3720 

attgcgatag gtaatggttc tacgaccact 3780 

aacatggacg caccgcagaa ctctgtcggt 3 840 

cgtcagatca ctaacgtcgc agcaggttcg 3900 

ttgaaagtaa cggatgcgca ggtttcccag 3960 

cgggtaacga atcttgattc acgcgtcacc 4020 

accaccggta gcaccaagta cttcaagacc 4080 

ggtaaagata gcgtcgcgat tggttccggc 4140 

ctgggtacag ggtctgtggc aaccgaagaa 4200 

caacgtcgta tcaccaacgt agctgcaggt 4260 

cagttgaagt cttccgaagc tggcggtgta 4320 

gactatagca atatcaccct cggtggcggc 43 8 0 

tccgctggcg tcaacaacaa cgacgtggtg 4440 

gaaacgaagc aatacaccga tcagcgaatg 4500 

gaaagcaagt tgagcggtgg tatcgcttct 4560 

tacactccag gtgccagcat ggcctctatt 462 0 

gttgctttag gtgtatcgat ggtgagcgcc 4680 

agtaccaata gccagggtga atactccgcc 4740 

4767 



COli 0157:H7 

gaagccagct acacggtgac ggcgaacgtc ■ 60 

agccataacg tgcaggtcaa taccgcgctg 120 

gacgatatta ttaacgccgc cgaatcgggc 180 

ggggcggcgg cgggcgatac ggttaccgta 240 

gtgcagggga atttaagctg gagcgtggac 3 00 

aatggcaatc tgacggttaa cgcttcggtg 360 

tcgcgagata ttactatcga cgccaacctg 420 

gatgatgtgg tcaatagcat cgagcacgct 480 

gggctggcgg cgggcgcggc gctgacggtg 540 

gtattagccg atggcacatg gagcgttggt 600 

gcgggtacgg tgaatatcac ggtaagtggc 660 

acccatccgg tcaccgtcga tctggcggcg 720 

gacgatgtga ttaacgccgc cgaaaaaggg 780 

ggcgtggaag tggggcaaac ggtcaccgtt 840 

gtagcgggcg atggtagctg gacaaccacc 900 

gacggcgacg ccaccgtgca ggccagcgtc 960 

acccacgcct acagcgtcga tgccacggcc 1020 

gacgatattc tgaacgctgc cgaggcgggc 1080 

gccgaagcgg ggcagacggt aaccgtcacg 1140 

caggcggacg gcagctggag cgtcagctta 1200 

agccagtaca ccgttagtgc ctcggtaagc 1260 

cacgggctgg cggtggatct caccgtgccg 1320 

gacattatta acgccgccga acacggacag 13 80 

ggcgaagcgg gtgatgtcat caccgtcaca 1440 

gacgcttccg gcaactggag cgtcggcgtt 1500 

ggcccgcaaa ctatcactgc ggcaattacc 1560 

cgcacggtca ccgtgaatct cgccgcgcca 1620 

gacgtgatta aagccacgga aaaaggcgcg 1680 

cctgcgggca ccaccattac ggtgacgctg 1740 

agcaacggca actggagcgc cacggtgcca 18 00 
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gcgtcagcgg 
acggcacjgca 
gttaccatta 
caaaccatca 
5 ggcggcaaca 
gcggcggata 
ggcgtccjgca 
ctgcgggtcg 
cfcggtgatca 
10 aacaacgtta 
gcggtagacg 
agtgcccjgaa 
atcaccatta 
ctggtccjtaa 
15 ggcggcaaaa 
cctgccgatc 
attaaccggca 
gtgaccatca 
atcaccatca 
20 aacaacacgt 
gcggatfctaa 
gcgggtaacc 
accattaaca 
gtggtcagcg 
25 ggtaaaaact 
gcggatcftca 
gcgggcaaca 
accattaaca 
accatcagcg 
30 ggcggcaatc 
gcgaaccjttt 
tccgcaaata 
ctgcgcgtcg 
ttggtgafcca 
35 aatagcgtca 
gtggcagatg 
accgcccfcyaa 
atcaccatca 
ttgcagcrtct 
40 ggcggcaaaa 
gccgtccjatg 
gtcgcaggga 
gttaccatca 
ctgaccatca 
45 ggcgttaatt 
ggcgatfcfcag 
gccagaaatt 
accatcaaca 
atcatcagcg 
50 acgaccacct 
agcgtgattt 
gcaggcaaca 
gccattaaca 
gcgattaccg 
55 caaaattaca 
cgggtgagcg 
ggcaacagcg 
attaacgtgg 
atcagcgggc 
60 gcgacttaca 
gcgctacagg 
ggcaatactg 
gtcgacaccg 
attaccgcjca 
65 caaacctatg 
gatgtcagcg 
ggaaatccgg 



ttagcgcatt 
acagtaattc 
acgcggtggc 
gcgggcaggt 
cctacaccgc 
ttcaggcgct 
acaccggcag 
ataccgtggc 
ccggcagcag 
cttatggcgc 
tcagcaactg 
caacctccac 
acactctctc 
gcggcagcac 
attacaccac 
tcgctgcgct 
acagcgccca 
acaccatcgc 
gcggcactac 
accagaccac 
gcggattaac 
cggcaagcgc 
ccgtcgcggg 
gcaccagtac 
acaccaccac 
cggcgctggc 
gcgacagcac 
ccgtctccgg 
gtcaggtcac 
aatataacgc 
tgcaggcgct 
ataccggcac 
ataccgtggc 
ccggtagcag 
cctacggcgc 
tcacaaactg 
ccacaaccag 
acaccctttc 
ccggcaccac 
gctacaccac 
ttgcaacctt 
acagcaccca 
acaccatcgc 
gcggcaccag 
acagcggcaa 
ctagccttac 
cggcttcggc 
cggtggcggg 
gctcggcaac 
ataccaccgt 
ccgcgctggc 
gcggcacggc 
ccattgccgt 
gcaccagcaa 
ccaccactgc 
ccctcggtga 
gttccgccag 
tggcaacgga 
aggtgacggg 
ctgccacggt 
agttgggcaa 
gtaacggcac 
tggcgggcga 
gcagctccgg 
ttgcggcggt 
cctggcctgc 
taagcgttac 



gggtgaagcc 
cgccagtcat 
gactgacgat 
gacgggcgca 
cacggtgcag 
gggcaacggc 
cggctcgcgc 
gggcgatgat 
cagcggatta 
gacggtatta 
gccagcgggt 
cattacccat 
tggtgatgat 
cagcggtgtc 
cacagtggaa 
accggacggc 
ggccgatcgc 
cagcgacgat 
cacggcgcag 
cgttctggcg 
cgccagcagt 
cgaccacgcg 
cgatgacatt 
gggcgcggcg 
gctggatgcc 
gaccggtagc 
gactcacgat 
cgatgacatt 
gggaacggcg 
caccgtgcag 
gggtaacggt 
cgcgacgcac 
aggcgatgat 
tagcggactg 
aacggtttta 
gcctgcgggg 
cattagccat 
cactgacgat 
ctccggcgtg 
cacggttgcc 
gccagacggc 
ggcaacgcat 
cacggacgat 
cacggcggaa 
tgtgcaggca 
cgccagctcg 
aacgcataat 
cgatgacatc 
gggcgcgact 
gctggacgcc 
gcagggcgat 
ctcgcacact 
cgatgacatc 
ccagcctgcg 
cgatgcttcc 
agccacctac 
ccataacgta 
cgatattatt 
tgcggcggca 
gcaggcgaat 
cggcgaactg 
gcgcgaaatc 
tgatgtggtt 
tctggcggcg 
gctggcggat 
gggatcggtg 
gcatccggtg 



aactacacgg 
aatgtgctgg 
attattaacg 
gcgcaggggg 
tctaatttaa 
gacctgacgg 
gatatcacga 
gttattaaca 
acggcgggaa 
gccgacggca 
acggtgaata 
ccggttaccg 
gtgatcaacg 
gaagccgggc 
gctaacggta 
gcgggcaacg 
gcgtatagcg 
atccttaacg 
gccgggcaga 
gatggcacct 
tacaccgtga 
ctggtggtag 
atcaacgcca 
gcgggtgatg 
tccggtaact 
cagaccatca 
gtgaccgttg 
atcaacgccg 
gttgccggga 
tcagatttaa 
gaactgacca 
gatatcgtga 
gtgattaaca 
gcggcgggcg 
gcggatggta 
acggtcaata 
ccggtcacgg 
gtgattaacg 
gaagcggggc 
gcggataata 
gcagcaaacg 
gcttacagcg 
attcttaacg 
gccgggcaga 
gacgggagct 
tataccgtta 
ctgacggtgg 
attaacgcca 
accggtaata 
aacggcaact 
gtgaccatta 
gtcaccgtgg 
atcaacgccg 
ggcacgcaga 
ggtaactgga 
acggtgaccg 
caggttaata 
aacgccgccg 
ggcgacacgg 
ttaagctgga 
accatttcgg 
accatagacg 
aatattatcg 
ggcagcaacg 
ggcacctgga 
acgattgcgg 
acggtcgatc 



taacggcaaa 
tcaacagcgc 
ctgccgaatc 
atacggttac 
gctggagtgt 
ttaatgcctc 
tcgacgccaa 
gcattgagca 
cggcgttaac 
cgtggagcct 
tcacggtaag 
tcgatctggc 
ccgtcgaaaa 
agacggtgac 
gctggacggt 
tgcaggcgag 
ttgatgccac 
tgagcgaagc 
cgctcaccgt 
ggagcgtgaa 
ccgccacggt 
atatcactgc 
tcgaacatgg 
tggtaaccgt 
ggagcgtggg 
ccgccagcct 
atcttagcgg 
ctgaaatcgt 
atacggtgat 
gctggagcgt 
tcagcgcctc 
tagatgccaa 
gcatcgagca 
cggcgttgac 
gctggagcgt 
ttgccgtctc 
tcgatctcgc 
ccgccgagaa 
aaaccatcac 
cctgggggct 
tccaggccag 
ttgatgccac 
ccgccgaagc 
cggtgaccgt 
ggagcgtcag 
acgcctcggt 
accttgccgc 
cggaacacgg 
cggtttccgt 
ggagcatcgg 
ccgctacggt 
cgctcggcgc 
cggagaaagg 
ttaccgttac 
gcgtgaccgt 
cagccgccac 
ccgcgctgcc 
aagcgggcgt 
tgaccgtcac 
gcgtcgatgt 
cttcggtgac 
cgaatctccc 
agcacgggca 
tcacgctgac 
gcgtcggcgt 
cgagcggtag 
tctcggcggt 



cgtcactgat 
cttgcctgcc 
gggaaatgcg 
cgttacgctg 
ggacgttccg 
ggtcaccaat 
tctgcctggc 
caatcaggcg 
ggtcgagatt 
cggtgttccg 
cggcaccaac 

tggggtcgcc 

aggcgaaacg 
cgtcaccttt 
gaatgtgccg 
tgtcagtaat 
cgcgccgctt 
tggcgcgggg 
cacgctcaat 
tgttccggca 
gagcgacaaa 
gccggatctc 
tcaggcgttg 
cacgctgaac 
cattccggcg 
gagcgatcgc 
cccgacgctg 
tgtggcgcag 
tgtcaccatt 
cagcgtaccg 
gttgaccaat 
cctgccaggt 
cactcaggcg 
ggtggttatc 
tggtgttccg 
tggcaccaac 
tgccgtggcg 
aggctctgat 
cgttatcttc 
gacgatccct 
cgtcagcaat 
tgcgccctcc 
gggatcggcg 
aacgctaaac 
cgtaccaacc 
cagcgacaaa 
tccggtcgtc 
acaggcgcag 
gacgattggc 
cgtgcctgcc 
caccgactcc 
tccggtgctc 
cgcggatctg 
gctcaacggg 
tccggcgtca 
tgacgccgat 
gggcgtcacc 
ggaacagacc 
gctcggcggg 
tccggcctcc 
gaacagcgta 
cggtctgcgg 
ggcgctggtg 
cattaacggg 
tccggcggtg 
cacctctgcc 
ggcggtgagc 
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atcaacgcca ttaccgccga tgatgtgatc 
ctctccggca gcacctctgg cgttgaagcc 
aaaacttaca gcgccacggt ggctgcgaat 
gatatggcgg ctctgcgtga tggcgatgcc 
5 ggcaacagcg ccaccacgac ccacgcttac 
attaatacca ttgcgggcga tgatattctt 
atcaccggca gcagcacggc ggaagcgggg 
aactacaccg gcaccgtaca gacggacggc 
ttaagcaccc tgaccgccag caactacacc 
10 aacccggcct cggttaatca caacctgacg 
aacacggtgg caggcgatga tgtgatcaac 
agcggctccg ccactggagc ggcaaccggt 
acctttacca cggtgctgga tgccagcggc 
gtctcggcac tggcgaatgg, cacggtgacc 
15 aacagcggca gcgctaccca tcaggtgacg 
aacgcoatca gcggcgataa catcctgaac 
agcggcggca gtacggggct ggcgacgggc 
aactacagcg ccaccaccga cgcatcgggc 
ctggcggcat taggtcaggc caactatacg 

20 aacaccgcca gcagccaggc gaatttactg 
aacaccgtgg caggcgacga tattatcaac 
agcggggtgg tgactcgcgc cgccgctggc 
acttacaccg ctacggtaca gagcaactta 
ctccaggcgt tgggcaatgg tgatttgacc 

25 aacaccggga gcggcacgcg ggatatcacc 
gataccgtgg cgggcgatga tatcgtcaac 
accggcggca gtagcggcct gaatgcaggt 
gcgtattccg ccacggtgca ggcggacgga 
gtcagcgcct ggcctgcggg gccgttaacc 

30 aacccagtca gcgtcagcca tccgttcacc 
aacaccgttg ccagcgacga cgtgattaac 
tccggcagta ccagcgggat tgagagcggg 
acctacactg caagcgtcgc cgccaacggg 
ctggcaactc tgccagaggg cgcggcgaat 

35 aacagtgcct cggcgaccca tgcgtatagc 
aacaccatcg ccagcgacga tatccttaac 
agcggcacca gcaccgccga • aaccgggcag 
tacaccggca ctgtgcaggc ggacggtagc 
ggcgccjctca acgcaagcaa ttacaccgtc 

40 cccggcagcg ccagccataa tctggcggta 
accgtgjgcgg gcgatgacat catcaacgat 
ggcaccagta gcggcgggga agcgggcgat 
tacaccacca ccctggatgc ctccggcaac 
acggccjctgg gtagcggtgc gcagaccatc 

45 agcgacgacg ccagccgcac cgtgaccgtc 
accatcgcgg gcgatgatgt gatcaacgcg 
ggcaccagcg atcagcctgc gggtacggcg 
agcgccacca cggatgcctc cggcaactgg 
gcgctgjggcg aagcgaccta cagcgtgacg 

50 agcaccgcca gccataacgt gcaggttaat 
gtggcgacgg acgatattat caacgcttca 
caggtaaccg gtgcggcggc gggcagcacc 
accgccaccg tccaggctga tttaagctgg 
gctctgggta acggtgagtt aacggttaat 

55 ggtagcggca cgcgcgatat caccatcgat 
gtcgcgggtg atgatgtggt caatatcatc 
agcagctccg gctttgccgc aggcacggcg 
gccgccacgg tgttggcgaa cggtagctgg 
aactggcctg cgggaacgct gaatattacc 

60 accagcatta cccatccgct gactgttgat 
atcaccagcg atgatgcgat taacgccgcc 
agcacgtccg gcgtcgaagc ggggcaaacc 
accaccacgg tggcggcaaa cggtagctgg 
gccctcjcgtg atggcgatgc cagcgcccag 

65 gccacagcaa cgcacgaata cagcgtcgat 
attgccagcg ataacatcat caacgccagc 
accagcaccg cgcaaaccgg gcagacgctc 



aacgctgccg aaaaaggcgc ggcgttaacg 588 0 

ggacaaacgg ttaccgtcac ctttggcggc 5940 

ggttcctgga gcacctcggt tccggcggca 6000 

agcgcacagg ccagcgtcag caatgttaac 6060 

agcgttgatg ccagcgcgcc aacggtgacc 6120 

aacgccgccg aagccggagc ggctctgacc 6180 

cagacggtga ccgtcacgct caatggcaca 6240 

agctggagcg tcagcgtacc gtcagccgac 63 00 

gtgaacgcgg cggtgagcga caaagccgga 63 60 

gtggatacgt ccgttccggt cgtcaccatc 642 0 

gcgacggaac acgcccaggc gcagatcatc 64 80 

agcaccgtga cggtgactat cggcacaaat 6540 

aactggagcg tcggcgttcc ggcaagcgtc 6600 

atcaatgcca gcgtcaccga tgccggagga. 6S60 

gtcaataccg ggctgccgac cattaccttt 6720 

gccgatgaaa aaggccagcc gttgaccatc 6780 

gcgcaggtca ccgtcacgct caacggtcac 684 0 

aactggacct taaccgtgcc ggtgagcgat 6900 

gtcagcgcca gcgccaccag tgcagcaggc 69 60 

gtcgacagcg gcctgccgga cgtcaccatc 7020 

gccgccgaag cgggggccga tcaaaccatc 708 0 

gatacggtca ccgtgacgct gggcgggaac 7140 

agctggagcg tcagcgttcc gacagccgat 7200 

attaccgcct cggtcaccaa cgctaatggc 7260 

attgatgcca acctgccggg gctgcgcgta 7320 

agcatcgagc acgggcaggc gctggtgatc 73 80 

gctgtgctga cggttaccat caacagtgtg 7440 

agctggagcg ttggcattcc ggcggcaaac 7500 

gtggaggtag acgggcaaag cagcgccaat 7560 

gtcgatttaa cggcggtggc aatcagcatc 7620 

gccgcagaaa aaggcaccaa tctgactctt 7680 

caaaccgtca ccgtcacttt tggcggtaaa 7740 

agctggagtg taaacgttcc ggcggcagat 7800 

gtgcaggcca gcgttagcag cgcgagcggt 7860 

gttgacgcca gcgcgccgac gctcaccatt 7920 

gccgcagaag ccggaagccc gctcacc'atc 7980 

acggtgaccg tcacccttaa cggcgcaacc 8040 

tggagcgtca gcgttcccac ttcagccctg 810 0 

agcgccacgg tcaatgacaa agcgggcaac 8160 

gacaccaccg cgccggttct caccattaac 8220 

gccgaacatg cgcaggcgct ggtgatctcc 8280 

gtggtgagcg ttgtgctcaa cggcaaaacc 8340 

tggagcgttg gcgttccggc ggcagatgtt 8400 

accgccagcg tcagcgatcg ggcaggcaac 8460 

agcctcagcg cgccagtgat tagcatcaac 8 52 0 

acggaaaaag gatctgatct ggcgctttct 8580 

atcaccgtca ccctgaacgg acaaaactac 864 0 

agtgttaccg tgcctgcctc ggcggtcagt 8700 

gcgagcgtca ccaatgctca gggtaacagc 8760 

accgcgctgc cgggcatcac cattaatccg 8820 

gaggcaggca gcgcgcaaac tatcagcggc 8 880 

gtcaccgttg aactgggcgg taaaacttac 894 0 

aatgtgagcg tacctgccgc cgactggcag 90 00 

gcctcggtga ccaacgccgt tggcaacacc 9 060 

gccagtctgc ctggcctgcg ggtggatacc 9120 

gaacacgctc aggcgcaggt gatcaccggc 9180 

ttgacggtgg tgattaacaa ccaaacttac 9240 

agcgtcggcg ttcctgcgac ggatgtcagt 93 00 

gttagcgggg cgaacagtgc cggaacgcaa 93 60 

ctcaccgccg tcgccatcag catgaacagc 9420 

gaaaaaggcg cggcgttaac gctctccggc 948 0 

gtcaccgtca cctttggcgg caaaacctac 9540 

agcaccaccg ttccggcggc ggatttggcg 96 00 

gtgcgggtga ctaacgtcaa cggcaacagc 9660 

agcgccgcgc caacggtgac catcaacacc 9720 

gaagcggcgg cgggcgtaac ggtgtccggc 9780 

accgtcacgc ttaacggcac taactaccag 9840 
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accacggtgc agacagacgg cagctggagt 
cttgctaata acggctacac cctgaccgcc 
agcgccagca aaggcgtgac cgtcgatacc 
gcgggcgatg atgtgattaa caacgtcgaa 
5 gccacgggcg cggtggcggg cgaccgcctg 
accagcaccg atgccagcgg caactggagc 
cttgcggatg gcacggtgac catcagcgcc 
acgcagacgc acaacgtaca ggtgaacacg 
agcggcgata accttattaa cgccgccgaa 
10 ggcactaatt tcgcgacagg tacagtggtg 
gccaccattc agagcaacgg gagctggagc 
ctcagtgatg gcaccagcta cacggttagc 
aacagcagca cgcagacgca caacgtacag 
agcactatca, gcggcgataa ccttattaac 
15 agcggcactg gcactaattt cgcgacaggt 
ggctacagcg ccaccattca gagcaacggg 
gtfcgcggcac tcagtgatgg caccagctac 
ggaaacagcg ccacggcctc gcgcagcgtg 
attaacaccg tttcgacgga cgaccgcctc 
20 cttaacggct cgaccagcgc ggaagtgggg 
acctataccg ccacggttgc cgccaatggt 
ctgjgccgcgc tcgggcaggg cgcgcagacc 
aaccccggac aggccacgca cgccctgacc 
gccacggtgg cgggtgacga tattatcaac 
25 agcggcacca ccaccgccga agtgggccag 
tggagcgcaa cggttggcag cggcggaagc 
gccggattaa gcgacggcag ctacaccatt 
cccggcagcg ccagccgtgg cgtgacgctg 
acctttgcgg gcgacgatgt ggtgaatgca 
30 ggcaccacca ccgcgcccgt cgggcagacg 
accaccacgg tgcagacggg cggtagctgg 
gcgctggcgg acggcaacgc ctacgtgatt 
. , accggtagca gtaatcacac. cattaccgtc 
atcgattccc tgcaagccga cactggcctt 
35 ccgcjtagtgg tcaacggctc gctcaccgcc 
agtatcgacg gcggcacaac ctggaccacg 
aacgacagcc gcacgctgac cgatggcaac 
gcgggcaacg ttggcgcgac cgacagccag 
cccgcggtga aaaccatcgc catcagcgcg 
40 gafcfcttgtca ccagcgacac gacgcttgcg 
gccggtgagt tcgcgcaaat cagcctcgac 
gttggcacca gctggagcta tgccgatggt 
acggtgcggg tggtggatct ggcggggaac 
gtcgacacca ccagcccgga agcggcgaaa 
45 accggaacca gcagcagcga ttttattacc 
ttgcjgcgcgg cgctcggcgc taatgagttc 
tgggtgaacg tgaccgtcgc cgcagacagc 
ctcaccaacg gcaccaccac ctggcaggtg 
gcaacgagca gccagtcggc gctgatcgat 
50 gccagcatca gcaccgacac ggggagttcg 
ctcacgctga ccggttcgct gggggcgggg 
cttgatagcg gcgcgacctg gacaacgctc 
gacagccgca cgctgaccga cggcagctac 
gggaacaccg gcccggtggt gtcgaaaacg 
55 acaccaacga ttgtgtcgta taccgatgat 
tcgcaggcca ccgacgacac tacgccgctg 
agcggtgaag tggtttacct ctaccgtaac 
ggcgctctga actggaccta cagcgacagc 
gcgcgagtgg tggatttggc ggggaatatc 
60 gatacctcta ttccgaccac gctggcgcaa 
ccgattatta gcggggtgat caccgccgcg 
atcaacggca aaacctacac ctctgaaccg 
aaca.cctggt atgtacagtt gccggatacc 
accgttactg cgcaggtaaa aagttcggcg 
65 ggcacggtga cggttaacgc ggcgattgat 
accaccgcct gggggctgac ctacggcctc 
aaccagcagg taatgcaatc gactgaccca 



ttaacgctgc ccgccagcga cttaaccgca 9900 

acggtcagcg atctggcggg taaccttggc 9960 

actgcgccgg tgatcagttt taacaccgtg 10020 

cacattcagg cgcagattat cagcggcacc 10080 

gtggtgacca tcgccgggca gcagtatgtc 10140 

gtcggcgtgc ctgccagcgt gatttccggc 102 00 

accattaccg atagcgcagg caacagcagc 10260 

gcggcagtgt cgctctcggt cagcactatc 10320 

gcgggcagtg cgctgaccct gagcggcact 103 80 

accgtgttgc ttaacggcaa aggctacagc 10440 

gtgaacgtgc ctgcggcgga tgttgcggca 10500 

gcctccgctc aggacagtgc cggaaacggc 10560 

gtgaacacgg cggcagtgtc gctctcggtc 10620 
gccgccgaag cgggcagtgc gctgaccctg- , 1063G 

acagtggtga ccgtgttgct taacggcaaa 10740 

agctggagcg tgaacgtgcc tgcggcggat 10800 

acggttagcg cctccgctca ggacagtgcc 10860 

gcggtggatc tcaccgcgcc ggttatcagc 1092 0 

aacgccgccg aacagcagca gccgttaacc 1098 0 

cagacggtca ccgtcacctt tggcggcaaa 11040 

acctgggcgc tgaacgtgcc tgcggtggat 11100 

attaccgcca gcgtgaacga tcgcgccggt 11160 

gtcgataccg ttgcgccaac ggtcaccatc 1122 0 

aacgccgagc agcttgccgg gcagaccatt 1128 0 

acggtgaccg ttacctttaa cgggcaaacc 11340 

tggtcggtgt ttattccggc gcagcagttt 11400 

agcgcgacgg ttagcgatca ggccggaaac 11460 

aacggcgatg tacctactgt caccatcaac 11520 

gcggaacacg gctcatcgct ggtgatcagc 11580 

ctgaccctga ccttaaacgg caaaacctac 11640 

agctacaccc tcggcagcgc cgatgtcacc 11700 

aacgcctcgg tgagcaatgc cattggcaac 11760 

gatctcagcg ctccggcgat gggcattaat 11820 

agcgccagcg actttatcac cagcgtcagt 11880 

gcgcttgcca gtaacgagac ggcgcaaata 11940 

cttaccgtta ccggcacgac ctggcgctat 12000 

tatctctatc aggtgcgggt gattgacgca 12060 

aatgtggtga tcgacactac cgcgccagat 12120 

atcaccaccg atatgggctt gatcactaac 12180 

gtgagcggca cgctgggggc gacgctttct 12240 

ggcggcgtca cctggactac gttaaccgtc 123 00 

cacacgctta ctgacggcac ctggaactac 123 60 

gttgggcaga ccgcaacgca aaacgtggtg 12420 

agtatcacca ttaccggtat cagcgatgac 12480 

agcgacacca cgcttaccgt gcgcggcgta 12540 

gcgcaaatca gtaccgacaa cggcgcaacc 1260 0 

ctgaactgga gttacgttga cggacgaacc 12660 

cgggtggtcg atctggcggg caacgttggc 12720 

accgttaacc cggcgcaggt gctcaccatc 12780 

gcaactgact ttatcaccag cgacaccatg 12840 

cttgccagcg gcgaagtggc gcagattagc 12900 

accaccaacg gtacacagtg gacttacacc 12960 

gtttatcagg tgcgggtgct ggatctggcg 13020 

gtggtggtcg atacgattaa ccccaccgcc 13080 

gtcgggcagc ggcaggggac attaagcagt 13140 

ctgaacggtg tactttccgc gccgcttgcc 13200 

gggctgctgt taggggcggt gacgatggtc 13260 

gggctggtga gcggtgccta tacctacagc 13320 

acctcctcca gtgattttgt cctgacggtc 133 80 

atcaccagcc agaccacgcg cgatactacg 13440 

cttgccagtg ggcagtatgt tgaagtggtg 13500 

ggcggcgcgg tagtggtcga tccggcgcac 13560 

gatgcgctga cagtttccgc gaccgcctat 13620 

ggtaacggca ataacgccaa tattagcaac 13680 

tacacaccga cctggactac cgccagcaaa 13740 

gactcgcacg ggatgtggac ggtgctggca 13800 

ctcacctggt cgaagaccgc gctgacgctg 13860 
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tatcagagcg gcaacaacta cgccaccagc tccattgccg attacgaccg taacggcacg 
ggcgatctgt ttatcacccg tgatgactac ggtacgggct atattaacgg ctttaccaat 
aacggcgatg gcaccttttc cagcgctatt caggtcaccg tcggcaccct gacgtggtac 
ggctcgattg tggcatfctga taaagagggc gacggctatc tcgacttctg gattggtgac 
5 gctggcgggc cggactccaa caccttcctg tggaacaacg caggcacgct ggtaggcaac 
tccaccacgt cgaacagcgg cggtagcgcc acggtgggcg gggcggtgac ggggtatctt 
tcgctcaacg aaggttctgg cgtcgatctg aacaatgacg gcaggatcga cctggttcag 
cacacctata acctgaacaa ctattacacg ctgtcttcgc tcatcaacca ggggaatggg 
acgtttgtct gggggcsgaa caccaccaat accttcctga gcggggcggg cagtggcgct 

10 atgagcagca gcgtttccat gacctgggcc gatttcgatg gtgacggcga tatggatctc 
ttcctgcccg ccagccaggg aagagctaac tacggctcgc tgttattcaa caccaacggc 
gtactgggtt gcccggtggc ggtgggcgca acggcaacca cctacgccag ccagtttagc 
ctggcggtgg actggaacca cgacggcctg atggatatcg cccgtatcgc ccagaccggg 
, , cagtcgtatc tttatactaa cgtcagcaac. gccagcaact .ggacgcaatc ggccctcggc 

15 ggcagccaga gcggtaccac cagcggcgtg gcggcaatgg actacgactg ggacggcgcg 
gtggatgtgc tggtgtccaa acagtcgggc agcgtgttcc tgagccgcaa caccaacacg 
gtgagctacg gcacttcgct acacctgcgc atcaccgatc ccaacggcat taacgtctat 
tacggcaata ccgtgaagct gtacaactcg gcgggagtgc tggtcgccac gcaaatcatc 
aacccgcagt cgggtafcggg ggttaacgac acctcggcgc tggtcaactt ctacgggctg 

20 aatgccggag aaacctacaa cgcggtgctg atcaaatcca ccggcaccac cgccagcaat 
atcgaccaga cggtcaacac cagctggggc ggtttacagg ccaccgatgc cactcacgct 
tacgatctca gcgctgaagc gggtaccgcg agcaacaacg gcaagttcgt cggcaccggc 
tataacgaca ccttctfccgc caccgcaggc accgacactt acgacggttc cggcggctgg 
gtgtacagct ccggcaccgg aacgtggctg gcgaacggcg ggatggacgt ggttgatttc 

25 cggctttcga cggtggg-cgt gacggctaac ttaagcagca ctgccgcgca ggccaccggt 
tttaacacct cgacatttac caatatcgaa ggcatttccg gctcgaattt taacgacatt 
ctgaccggca gcagcggcga taaccaactg gaagggcgcg gcggcaacga cacgctcaac 
atcggcaacg gcggccacga caccttgctc tataaactgc tcaacgccag cgacgccact 
ggcggcaacg gctcagacgt ggtgaatggc tttacggtag ggacatggga aggcaccgcc 

30 gacaccgatc gcattgatat tcgtgaactg ttacagggca gcggctacac cggcaacggc 
aaagccagct acgtcaacgg cgtggcaacg ctggatgcgc aggccggaaa catcggtgac 
tttgtcaaag tcacccagag cggcagcgac accatcgtgc agatcgaccg cgacggcacg 
ggcggcactt ttgcgacaac taacgtggtc acgctgacgg gcgtgcacaa cgacctcgcc 
accttgctgg cgaatcatca gttgatggtg gtgtag 

35 <212> Type : DNA 

<211> Length : 158 76 

SequenceName: : SEQ ID 660 
SequenceDescrription : 

40 Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

ttgggtgttc atacagccga agccacactg cctaacggca acaacgatac gaagatagtc 60 

45 aatattgcgc ccgatgccag caacgcgcag gttacgctga acatccctgc tcaacaggtg 12 0 

gtgacgaata acagcgacag cgtgcagctg acggcgacgg tgaaagatcc gtcgaatcat 180 

ccggtggcgg gaataacggt gaacttcacc atgccacagg acgtggcggc aaactttacc 240 

ctcgaaaata acggtatfcgc catcacccag gccaatgggg aagcgcatgt cacgctcaaa 3 00 

ggtaaaaaag cgggtaccjca tacggttacc gcaacgctga gtaataacaa taccagtgat 3 60 

50 tcacagccgg taacgtttgt ggcggacaaa acctcggctc tggttgttct tcagatatca 420 

aaaaatgaga tcacaggtaa tggcgtcgat agcgcaacgc taactgcaac ggtcaaagat 480 

cagttcgaca atgaggtgaa caatcttccg gtaacattca gcacagcttc ttcaggcctc 540 

accctgaccc caggggaaag taataccaat gagtctggca tcgcgcaggc cactctcgca 600 

ggcgttgcct ttggtgagca gacggtcact gcatcactgg ctaataatgg tgccagcgac 660 

55 aacaaaactg tgcattttat tggcgacaca gcggcggcaa aaattatcga gttgacgcct 720 

gtcccagaca gcataatcgc aggtaccccg cagaacagct ccggcagcgt catcaccgcc 780 

acagtcgttg ataataatgg ctttccggtg aaaggtgtga ctgtgaactt caccagcaac 840 

gcagcgacag ccgaaatgac gaatggcggt caagccgtga cgaacgaaca gggtaaggct 900 

accgtcactt ataccaatac ccgctcctcg atagaatcag gagcgagacc ggataccgtt 960 

60 gaggccagtc tggaaaatgg tagctccacg cttagcacat caattaatgt caacgctgat 1020 

gcgtctacgg cacatctcac cttgctacag gcactttttg atacagtctc cgcaggcgac 1080 

actaccaatc tgtatattga ggtgaaggat aattacggca acggagtacc ccagcaggag 1140 

gtaaccctca gcgtttcvacc aagtgaaggt gtgaccccca gtaataacgc tatatatacg 12 0 0 

accaatcacg acggcaatfct ttacgcaagc tttaccgcta caaaagccgg ggtataccaa 1260 

65 gtgacggcaa ccctcgaaaa tggcgattcg atgcaacaaa cagtgaccta tgtgccgaac 1320 

gtagcgaatg ctgaaatctc gctggcagcc tcgaaggatc cggtaattgc caacaataac 1380 

gatctcacga cactaacagc aacagtcgct gatacagagg gcaatgcgat agccaacagt 1440 



13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
.14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15876 
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gaggtaacat ttactctgcc ggaagatgtg agggcgaact 
aaagtggtta ctgatactga aggcaaagcg aaagtcacgc 
gctcatactg ttacagcatc gatggctggc ggtaagagtg 
attgcggata cactcactgc gcaggttaat cttaacgtta 
5 aataacgtcg ggatgaccag gctgcaggca acagtgactg 
gccaatgagg cggtgacatt cacgctaccg gcagatgtga 
caaggcggtt ccgccattac tgacatcaac ggcaaggctg 
aaatccggca cctaccccgt gacagttagc gtgaacaatt 
caggtgactt tgattgccga tgctggtacc gcaaaactag 
10 tcattcgtcg tcagcacgac cgagggcgcg accatgactg 
ggcaacccgg tagaaggtat aaaagttaat ttccgcggaa 
accagcgttg aaacggatga tcggggtttc gctgaaattc 
ggactgaaaa cagtttcagc ctctctggca gataaaccta 
ctgaatgcaa aagcaga.tat taafcfcctgca acgattacca 

15 caggtcatgg tcgcacaaga cgtagcagtt aaagctcacg 
ccgattctta atgaatctgt aacattcagt gcagaaccac 
caaaatattg tctctactga tacgcatggt atagccgagg 
aacggttcgt atatggtgaa agcatccctg gcgaatggat 
gtggtaatcg atcaaaaact gacactctcg gcgtccagcc 

20 ccaacaggtg caactctgac ggcaacgcta acttctgcaa 
caggtcatca actttagcgt aacgccagaa ggtgcgacgt 
accaactctt caggtcaggc tccagtcgtt ctgaccagca 
gtgactgcat cgttccataa cggcgtaaca atacagacac 
ggcaactcaa gcaccgccca tgttgctagc tttatcgctg 

25 accaacagtg atttaagtac cttaaaggca acggttgagg 
gaaggtctca ctgtgtactt cgccttaaaa agcggctctg 
gcggtgacag atcaaaacgg aatcgcgaca acaagcgtga 
gtcacggtaa gcgcagtcac gaccgctggt ggaatgcaaa 
gcaggcccgg cagacgcctc gcagtccgtc cttaagaaca 

30 gactttaccg atagtgctga gctacatctt gttctgcacg 
aaagtttctg aagggctgga atttgtgcag tcaggtacca 
agtgcaattg actacagtaa aaatttctca ggcgagtaca 
ggagagggta fccgcaacgct gatccctgta ttgaatggtg 
accacaatac aattcactcg cgcagaagac aaaataatga 

35 ggtgctaacc taccgacaac tacattccct tcgcaggggt 
ttgaataatg acaactttgc cccaggaaaa acggcggctg 
gcctcctggg ttgatgttga tgctaccggt aaagtgacat 
tgggagagga ttacggcgac gccaaaaaca ggcggcccta 
gtgaagagtt ggtgggtgaa cgccggcgat gctttcatga 

40 ttttgcagta gcaatggcta cacacttccc cttggagacc 
cgaggcatcg ggtcactgta cagtgaatgg ggagatatgg 
ggttttcatt caaatatgta ttggtcatcg agtcccgcaa 
gtttccctgg caacaggtga tcaaagcgta tttgaaaagc 
tgttataaaa acctctga 

45 <212> Type : DNA 

<211> Length : 4038 

SequenceName : SEQ ID 661 
SequenceDescription : 



tcacgctggg 
tgaaaggtac 
agcagttggt 
ccgaggacaa 
atggaaacgg 
gcgcaagctt 
aagttacact 
atggtgtcag 
cctccttaac 
caagcgtcac 
cttccgtcac 
ttgtgacaag 
ctgaagtcat 
gtctggagat 
tcaacgacca 
cagagcacat 
tcactatgac 
cctcttatga 
cgcttatcgg 
atggcactcc 
taagtggcgg 
ataaagtcgg 
agacaatcgt 
atccatcgac 
atggcagtgg 
ccacattaac 
gaggagcgat 
cagtagatat 
atcggtcatc 
atatatcagg 
acgcgcccta 
aagccactgt 
ttcatcaagc 
gcggtacagt 
tcactggggc 
attatgagtt 
ttaaaaatgt 
gctatatata 
tatacagcct 
atttaaacca 
ggcafctacac 
actcaaacga 
ttgggtttgc 



cgatggcggt 
aaaagcaggc 
ggtgaacttt 
ttttatcgct 
caacccgtta 
tactctcgga 
gagcggtaca 
tgatacgaaa 
ctctgtatac 
tgacgctaac 
gctaagcagc 
caccgaggtc 
ctcgcgatta 
acctgaaggt 
gtttggcaat 
gaccatcagc 
gcccgaaaga 
gaaggatctg 
tgtcaattcc 
agtggagggt 
aaaagtgaga 
tacatatacg 
gaaagtcact 
tatagccgcc 
taacctgatc 
gtcattaaca 
aacggggagc 
aacgctggtg 
attgaaagga 
caatccgatc 
tgtgcaagtt 
tacaggcggc 
gggtctgagt 
gfcfcagtcaat 
gtattatcag 
ttcaagctct 
cggcagcaaa 
cgaaatccga 
tgctgaaaat 
tagtcgttcc 
gactgaagct 
acaatacgta 
ttatgcgaca 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
22,80 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4038 



50 



Sequence 



55 



60 



65 



<213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

atgagcctga ttattgatgt tatttcgcgt aaaacatccg tcaaacaaac gctgattaat 60 

cctggcgacg tcacggttgt tatttatgag ccttccgtgg tgcaggttca tgctcaggcc 120 

tctgccgttg cgcgttacgt ccgtgaagga aatgacctgc tgatctatat gcaggacggc 180 

acggtgatcc gctgcaacgg ttatttcctg caagcggcga atacagctga acaatcggaa 240 

ctggtgtttg ccgatggtca acagctaacc catatcacct ttgccgatac tgctgcgggt 3 00 

ccgtagaact gactgcccag accactgcga ttgaaagcat tgcgccattt 3 60 

ttgctcagac cagcgccttc ccgtggggtt ggctggcggg ggcggcggta 42 0 

ggtggtggcg cgcttggtgc actgctggca agcggtggcg atggcgactc gaaaacagaa 480 

gfcgattaata accctacgcc acctgctgag cctggcaacg ccacaccatc atttttagtt 540 

accgataatc agggcgatca gcgcggcatt ctagccacca atgacatcac cgatgacacc 600 

acgccaacct ttagcggcag cgggcaggcg ggggcgacta ttcagattaa agacagtaac 660 

ggcaatacta ttgccagtac tcaggtagac aacaacggtc actggagtgt ctcgttaccc 720 

acgcaaagtg caggtgaaca tacctggtca gtggtgcaaa ttgtcggcag taccatcact 78 0 

gacgccggtt cgataacgtt aaccatcgac aatagtcagg ccagcgtgca ggttgccacc 84 0 



ggattagccc 
cttgataccg 
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accgcaggcg ataacattat taacgccagc 
accagtagcc atctggcgca gggaacagaa 
acgaccagcg taggcgctaa cggtcgcctgg 
gcgttaggcg aagggaatca ggcgcjtgctg 
5 gtcaccggcg cgcagctact aacgcjtcgat 
atcgctcagg acaacattat cagtgjctgcg 
acgtcgaatg cagaagcggg gcaaaccgta 
gcaaccgtcg gtagcgacgg aacctggcaa 
ctggcggagg gtaattacgc tgtcaatgcc 

10 agccacagcg cgaatttcac ggtagacacc 
gcgggcgacg atattcttaa taatcjccgag 
gtcagcggtg cttctccagg cgatacggta 
ggcatcgtgc tggcagatgg cagctggaat 
ctggatcgcg gagccaatac gattfctcgtc 

15 gcggcgtctc gagcaatcac gctgcjtcggt 
tccggcgatg acattatcag tggcgcagaa 
actcaacagg ctgagacagg acaaaccgtc 
actaccgtgc aggccgatgg ctcctggagt 
ctgcctgacg gcgcggtggc gattaccgct 

20 aacacttccc gcaccattac cgtcgatagc 
accgctgata acatcattaa cgccgccgaa 
accgacgctc agccggggca gacgcgfcgacc 
gtcgtgcagc cagacggcac ctggagcgtg 
gctgacggca acgctacggt caccgccagc 

25 gtttcacgcg tggcgctggt ggatcgccacg 
accgataacg tcatcaacac gccggaacat 
actggcgctc aggcgggcga tatcgtcacc 
gtggtggatg gttccggcaa ctggagtctg 
gcggacggca gttatcctgt cagcgtctcg 

30 cagtcattga ccgtcacggt caataccgcc 
ggcgatgatg tgattaacgc cagcgaaaaa 
gatcagcctg ttaacaccgc catcaccgtg 
accgacgcct ccggcaactg gagcgtcacc 
caggccaact atacggtaac ggcgcjcggtg 

35 agccataacg tgctggtcga cagcgcgctg 
gacgatafcta ttaacgccgc cgaacjcgggc 
ggcgcggaag atggcgacac ggtaactatt 
gtgggcagca atctcacctg gagccjtggac 
aatggcgatt taacggttaa tgcctcagtc 

40 acgcgggata tcaccatcga cgccraatctg 
gatgatgtgg tcaatatcat cgagoacggg 
gggctggctg aaagcacgcc gcttaccgtt 
gtgcaggccg atggtagctg gagccgfcgggc 
gcggggacgg ttaatattgc cgtttcaggg 

45 acgcatccgg tgacggtgga tctcactccg 
gacgatgtga ttaacgccgc agaaaaaggc 
aacgtagaac ccggtcaaac cgtcaccgtc 
gtagcgagcg atggtagctg gactgccacc 
gagggcagcg cctccgcact ggccagcgtc 

50 gtgcacaact acagcgtcga cagcagcgcg 
gacaatatcg tcaacgccag cgaagccgat 
gccgaagcgg ggcagattgt tacgataacg 
gcaacggtgc aggcggacgg cagctggagc 
ttgaccgatg gcagccacac cctgaccgcc 

55 agcaccacgc ataatctggc ggtg^atctc 
gcgggcgatg acattattaa cgccaccgaa 
agcaccggcg gagaagcggg ggatgtcgtc 
accaccctgg acgcctccgg caactrggagc 
cttggcagcg gcccgcaaac tgtcaccgcc 

60 aattag 

<212> Type : DNA 
<211> Length : 4386 

SequenceName : SEQ TJD 662 
SequenceDe script ion : 

65 



gaacaggccg ccgggtttac gctttctggc 90 0 

ctcaccgtta cgctaaacgg caaaacctac 960 

agcgtgcagg tgccgaccgc cgatgcacag 1020 

gtcagtggga aagacgccac aggcaatacg 1080 

acccaaccgc caacgcttgc catcaacacc 1140 

gaacataacg tcgcgctggt actgagcggc 120 a 

acactgaccg tcaacgggaa aagccataca 12 60 

gtgacgctgc ctgccacgga agtccaggca 1320 

agtgtcagcg atcgggcagg gaacaccacc 13 80 

tcagcacccg tggtcagtgt taataccgtg 1440 

caggccgtcg cgcagatcat ctccggacaa 150 0 

acggtgaaat tgggcactca tgtcctgacg 15 60 

gtggcgctgg acccagcggt aacccgcacg 1620 

accgtgacag atgctgcagg, aaatactggc A 68 . 0 

gtttctccgt tgatcaccat taacaccgtc t, 1740 

aaaggtgcgc cactgaccct taccggtagc 18 0 0 

acagtaaccc tggctggaca gagttttacc 18 60 

ctgacggtac ctgccgccgc gatgggaaat 1920 

tctgtgacgg atctcagcgg caataccggc 1980 

caggccccgg ccttaagcat tgatccactg 2 040 

agcgggcagg atctgcccat caccggcacc 2100 

gttacgttaa atgggcagac gtatcagggc 2160 

actgtgcccg ccgccaacgt gggcgcactg 222 0 

gtgaacgatg tcgccggtaa tccgagcagc 22 8 0 

ccgccggtgg taaccattaa tccggtggcg 23 40 

gctcaggcgc aaatcatcag cggcacggtt 24 0 0 

gtgacgctga ataatgtgga ttacaccacg 2460 

ggcgttccgg cctcggtggt cagtgggctg 252 0 

gtaaccgaca aagccggaaa cacgggcagc 2580 

gcgcccctta tcggcattaa cagcattgcg 2640 

ggggccgatc tccagattac cggcaccagc 270 0 

acgctgaacg ggcaaaatta caccaccacg 2760 

gttccggcat cggcggttac agcattaggc 282 0 

accagcgata tcggcaacag cgccactgcc 2880 

cccggtgtga ccattaatcc ggtggcaacc 294 0 

gtggcgcaaa ccatcagcgg gcaggtgact 3 000 

acgttgggtg gtaatactta tacggcgacg 3060 

gttccagcgg cagatattca ggcgctggga 3120 

accaatcaaa acggcaacac cggcagcggc 3180 

cccggcctgc gggtcgatac ggtggcgggc 3240 

caggcgctgg tggtcaccgg cagcagctcg 33 00 

acgattaata atgtggaata caccactgcg 3 360 

gtcacggcgg cgcaggttag cgcctggcct 3420 

gaaagtagcg ccggaaactc ggtgagcatt 3480 

gcagcgatca ccatcaacac catcgccacg 3 540 

gctgatttaa ccctttccgg caccaccact 3 600 

acctttggcg ggaaaaatta cactgccagc 3660 

gtacccgccg ccgatctggc gtcattaccc 3 72 0 

agcaatatca acggcaatag cgcctcggcg 3780 

ccaaccatca ttatcaatac cgtcgccagc 3 840 

gcgggcgtga cggtgagcgg cagtaccacc 3900 

cttaacagcc cgaccgtgca gacctatcag 3960 

atcaatattc cggcggcaga tcttgaggca 4020 

acggtcaatg acaaagcggg caatccggcg 408 0 

accgttccgg tgctgaccat caacaccatt 4140 

cacgggcagg cgctggtgat ttccggttcc 4200 

accgtcacgc taaacagtaa aacctacacc 42 60 

gtcggcgttc cggcggcgga tgtcacggcg 4320 

acggttaccg atgcggcagg caacagcgac 43 80 

4386 



Sequence 
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<213> Organi smName : Escherichia 
<400> PreSequenceStrincj- : 
atgaatagaa tctatcgcgt gjatatggaat 
gaattaactc gcagggtagg taaaacatca 
5 acaacgaaat tcagtagatt gacgctgggt 
ggtgcgagtc tggaagttga fcaatggtcag 
gatgcctacc tggttggctg gtatggcact 
gcctccttaa ccactattac taccagcgtc 
gttaatgttt tgggtggcac ctggcgattg 
10 aatgtgggtc aatccggaac cjgggacgctg 
ggctatttaa gattaggtac fccaagctgca 
gactcggttt tgacgaccga attattcgaa 
attacggata agggttacgt cacgagttca 
— "agtaatggta aggttgtcgt ..fcgaaaagggt 
15 tcaattgaat ttcaaattgg taatcaagga 
ctgattacgg ctgaaaatac cjattatcggt 
gtgcaggatc aagactctgt catcacggta 
ggcgcagtca atatttccaa taatggacta 
gttcaggacg gttcccacgg tgtcgtcaac 

20 ggaacgggcg aagctttccg ctatatctat 
gtctcgcgtg aaggtaaagt agattcggga 
acaggcaacc ttactgttaa ggataagaac 
ggttatgacg gccacggcga aatgaatatc 
ggaagctctc tcggttatgg agaaaccggc 

25 atatgggagg tcaataagaa tgtctatacc 
aatattagcg atggcggtaa cjttcgtatcg 
agcggtatcg gcacactgaa cctgatggat 
aatgtcggta attttggtag cggtatcgta 
acgggctatg gatttatcgg aggaaatgcc 

30 gatagtctct ggaatttaaa gacgtcttct 
ttaggcacgg gtgaactgaa tatfcaccacc 
afcagctctca atgacaaaag fcaagggcgac 
gaaacattca atatgtacgfc agggacatct: 
ggcacgctga atgtcgaagg tggagaagtt 

35 acgctaaaca ttggtgcagc fccacggtgag 
gcgacgaaag tagagtttgg ctctggcgaa 
agtgatgccg gctaccaggfc cgafcatgctg ^ 
atccatgatg caggccatac ggtgttcaat 
gtcaatgacg gcctcctgac crafctgcgtct 

40 tcgagtgaag taaccattgc aagccccggt 
gcaggagatt acacgctgac craatgcgctc 
tcatcctccg acaagatgtt fcggctttaca 
gcccaactga aagacagtac cttcactctg 
gcgatgttgc agtctgacat tgaaaatacc 

45 ggtggactgg ccatgaatgg cggtacgctc 
cttgcagagg gatatatcag cgtcgatacg 
aaaggccgta actatcaggt aaacgggacg 
tggaatgatc ctatggcgaa taaccctctg 
aaccatgtcg gcgttcaact gjgtgaaggcg 

50 acgttacgtg atttacaggg cgacgaggtg 
aacggaacgg tggtcgccga gggtgattat 
ggtttgtacg ttaactatgg gctgaaagcg 
ttagccgaac atggcggagc ctatggcgca 
gaaggggatc tggcaatcaa fcacggtgcga 

55 tatcaggggg caacctacgt tcagatgggg 
ggcaacaccc gggaactgaa catcagcaac 
cagacggtag agacattcac cgggcagatg 
ctgacggtaa ataaaggtgg gatcagtcag 
gttacagggg gaacgctggc tgtcgagggg 

60 gttagcccaa atgcggaagt cagcctcgat 
gccaatgacg gtctgttaac gctaaaaaac 
gggaagggta tcgtgagcgc aaccgccagg 
cgctttgtgg ggcaattcaa cattgataca 
aacctgggtg atgcttccgt tatcaataat 

65 tgggcgatga cgcacagtat cagcggtagc 
ctgactctta acaacgattc ctcggcgtat 
attgctttcg gttccgactc tgccattaat 



298/341 



coli OX57:H7 

tgcactctac aggtatttca ggccfcgctcg 60 

acggttaatt tgcgaaagtc ctctggactg 12 0 

gttttgctgg cactaagcgg ttcagtgtct 180 

attaccaata ttgatactga tgttgcttat 24 0 

ggagtgctta atattttggc tggcggtaat 300 

attggcggta atgaggattc ggagggtacc 360 

tatgatagcg gaaataatgc aaggccttta 420 

aatattaaac agaagggtca cgtcgatgga 480 

ggcgtcggga cggttaatgt tgagggagag 540 

atagggagtt acggtacggg ctcattaaat 600 

atagtcgcca ttttaggata tcaagcgaac 660 

ggcgagtggc taataaaaaa taatgattcc 720.. 

actggggagg cgactattcg cgagggtggg 780 

ggcaatgcca ccggtgtcgg aaccctgaat 840 

cgcagactct ataatggata tttcggtaat 900 

attaataaca aagaatatfcc attggtgggc 960 

gtgaccgata aagggcattg gaatttcctc 1020 

atcggtgatg ctggcgacgg tgaacttaat 1080 

attatcactg cggggatgaa agaaacaggc 1140 

tccgttatca ctaatctcgg aactaatctt 1200 

agtaatgagg ggcttgttgt cagcaacgga 1260 

gtcgggaagg ttagcatcac cacggggggg 1320 

accattggtg ttgcgggcgt cggaaacctc 13 80 

caaaatatta cttttttggg cgataaagca 1440 

gcgacatcgt cgttcgatac tgtgggtatc 1500 

aatgtcagta atggtgccac ccttaattca 1560 

tccggtaagg gaatagttaa tatttcaacg 162 0 

actaacgccc aattgctaca ggtcggtgfca 168 0 

g9 a ggtatag ttaaagcgcg tgatacacag 174 0 

gtgagggtgg atgggcagaa ctctcttctt 1800 

ggtacgggta cgttaaccct gacgaatagc 1860 

tacttaggtg tttttgaacc tgctgtagga 1920 

gcggcggcag atgccggatt tatcaccaac 1980 

ggtgtttttg tctttaatca tactaataac 2040 

afctacaggtg acgataaaga cggaaaagtg 2100 

gcagggaata cttatagcgg taaaacgctg 2160 

catacggcag atggggtaac gggcatgggg 222 0 

acgctcgaca ttctcgcatc aacgaacagt 2280 

aaaggcgatg gcttgatgcg agttcagctg 2340 

catgcaacag ggactgaatt cgccggtgtt 2400 

gaacgcgaca acaccgctgc gcttactcac 2460 

acatcggtaa acgtgggaga gcaatccatt 2520 

attttcgata cggatattcc tgctgcgacg 2580 

ctggttgtcg gcgcgagtga ctacacctgg 2640 

ggcgacgtgc tfcatcggcgt gcctaaaccg 2700 

acgacgctca atttgctgga acacgatgat 2760 

caaacggtta ttgggtcggg tggctcatta 2820 

gaagcggaca aaacgttaca cattgcgcaa 288 0 

ggattccgcc tcacgaccgc accaggtgat 2940 

ctgaacatcc atggagggca aaagctgacg 3 00 0 

acggccgata tgtcggcaaa aatcggtggt 3060 

caggtttcgc tttccaacgg tcagaacgac 3120 

acattacgta ccgatgcgga tggcgcgctg 3180 

gcagccatcg tcgatcttaa tggatcgacg 3240 

ggttcgactg ttttgttcaa agaagggtcg 33 00 

ggtgaactga caggtggcgg aaacctgaat 33 60 

cttaatgcac gctacaatgc gttaaccagc 342 0 

aatacgcagg ggttaggcag aggaaatatt 34 80 

gtgactggcg aactgcgtaa tagcataagc 3540 

acagatgtag agctggatgg cgataatagc 3600 

ggcagcgcgc tcagcgtcaa cgagcagaaa 3660 

ggcctgctca ccatctccac tgagcgtagc 3720 

ggtgatttga caaaactggg taccgggatc 3780 

cagggtacga cggatatcgt ggggggggaa 3840 

acggcaagtc aacacattaa tatccataac 3900 
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agcggtgtga tgtcgggaaa tgtcaccact gcaggtgatg tgaacgttat gtctgggggg 3960 

acactgcgtg tcgctaaaac cacaatcggc gaatcggcgg caacctggag aatggcggca 402 0 

cggttcaaat ga 4032 
<212> Type : DNA 
5 <211> Length : 4032 

SequenceName : SEQ ID 663 

SequenceDescription : 

Sequence 

10 

<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString- : 

atgggcatca aacaacacaa tgggaatacc aaagccgatc gtctcgctga attaaaaatc 60 

c^cgtt.qgccct caattcaact, gataaaattt ggcgctattg.gtttgaafcgc aattctcttt 120. 

15 tcccccctgc tgatagctgc tgatacagga agtcaatatg gcaccaatat tactattaat )180 

gatggtgaca gaattactgg agataccgcc gatccatcag gaaacctcta tggtgtaatg 240 

accccagcag gaaacacgcc tggcaatatc aacctgggta atgatgtcac cgtcaatgtc 3 00 

aacgacgcct ctggatatgc aaaaggaatc attattcagg gcaaaaacag ctccctgaca 3 60 

gctaaccgac tcacagtaga tgttgttggt caaacctctg ccatcggcat taatttaatt 420 

20 ggtgactata cccatgctga cttaggcaca ggcagcacca ttaagagtaa cgatgacggc 480 

atcattattg ggcatagctc aacactaaca gccactcaat tcaccattga aaactcgaac 540 

ggtataggcc taaccatcaa tgactatggc accagtgtcg atcttggaag cggaagtaaa 600 

atcaagaccg atggaagtac aggtgtttat atcggtggtc tcaacggcaa taacgccaat 660 

ggtgctgcgc gttttacggc gacagacctg acaatcgatg ttcagggcta cagcgccatg 72 0 

25 gggataaacg tacagaaaaa ctctgttgtc gatctcggaa caaacagttc cattaaaacc 780 

agtggcgata atgcacacgg cctctggagc tttggccagg ttagcgcgaa tgcactcact 840 

gttgatgtaa ctggagccgc ggccaatggc gtcgaagttc gtggtggtac aaccactatc 900 

ggtgcagata gccatatttc ttccgcgcag ggcggtggtc tcgtcaccag tggttcagac 960 

gcgacaatca atttttctgg cacggcagcg caacgaaaca gcatcttttc cggcggttct 1020 

30 tatggtgcct cggcccagac ggcaacggct gttatcaaca tgcaaaatac cgatattacg 1080 

gttgatcgta atggcagtct ggcgctgggt ttgtgggcgc tcagcggcgg tagaataacc 1140 

ggagacagtt tggctatcac cggcgcggca ggagccaggg ggatttatgc catgaccaac 1200 

agccagatrcg acctcacgag cgatctggtc attgafcatga gtacacccga ccagatggcc 1260 

atcgcaacgc aacatgacga tggttatgcc gccagccgca tcaacgcctc gggtcgtatg 1320 

35 cttatcaacg gtagcgttct tfcccaaaggt gggctaatca atctggatat gcaccctggg 1380 

tcggtttgga caggttcctc cctcagcgat aatgtcaatg gcgggaaact ggacgttgca 1440 

atgaataaca gcgtctggaa cgtaacaagt aattctaatc tcgacacgct. ggcgctgagc 1500 

cattcaactg tcgattttgc cagccacggg tcaactgccg gcacatttac cacattaaac 1560 

gtagagaacc tgagcggtaa eagtaccttt attatgcgtg ctgatgttgfc tggcgagggt 1620 

40 aatggcgtta agccctgggc ctga 1644 
<212> Type : DNA 
<211> Length : 1644 

SequenceName : SEQ ID 664 
SequenceDescription : 

45 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

50 gtggggatcg acagccgtaa tgratattcct gaggggattg cgacgctggg cgcttttatg 60 

ggttattccc attcacatat cggttttgat cgtggaggac atggcagtgt ggacagttat 120 

tctctgggcg gctatgccag ttgggaacat gaaagtggtt tctatctgga cggtgtcgtg 180 

aagctgaacc gttttgaaag taacgtagcc ggtaaaatga gcagcggtgg agccgccaat 240 

ggcagttacc atagcaacgg gctgggcggt cacattgaaa ccgggatgcg atttaccgat 3 00 

55 ggtaactgga acctgacgcc gtatgcctcg ttaacggggt tcaccgctga taaccccgaa 3 60 

tatcatttat ccaatggcat ggaatcgaaa tcagtcgata cccgcagtat atatcgtgaa 420 

ctgggtgcaa cgctgagtta caacatgcgt ctggggaacg gtatggaagt tgagccgtgg 480 

ctgaaggcgg ctgtgcgcaa agaatttgtc gatgataacc gggtgaaagt gaatagtgac 540 

ggtaatttcg tcaatgattt gtcgggcaga cgtggaatat accaggcagg tattaaagcc 600 

60 tcattcagca gtacgttaag cgggcatctt ggggtggggt atagcaacgg tgctggtatg 660 

gaatccccgt ggaacgcggt ggctggtgtg aactggtcgt tctga 705 
<212> Type : DNA 
<211> Length : 705 

SequenceName : SEQ ID 665 

65 SequenceDescription : 

Sequence 
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<213> OrganisniName : Escherichia coli OlL57:H7 
<400> PreSequenceString : 

atgaaaaaaa aggttctggc aatagctctg gtaacggtgt ttaccggcat gggtgtggcg 60 
caggctgctg acgtaacagc tcaggctgta gcgacctggt cagcaacagc caaaaaagac 120 
accaccagta agctggttgt gacgccactc ggtagcctgg cgttccagta tgccgaaggc 180 
attaaaggtt ttaactcaca gaaaggtcta tttgacgtgg ctatcgaggg tgactcaacg 240 
gctaccgcct ttaaactgac ctcacgtctt atcaccaaca cattaaccca gttggatacc 3 00 

tcaggttcca cactgaatgt gggcgtggat tataacgcfcg cggcagtcga aaaaactggc 3 60 

gataccgtga tgatcgatac cgccaacggc gtactgg-gcg gcaaccttag cccgctggca 420 
aacggttaca atgccagcaa tcgtaccacc gcacagga.tg gtttcacttt ctccatcatc 480 
agcggcacca ccaatggtac caccgcagta accgattaca gcactctacc ggaaggcatc 540 
tggagcggcg acgttagcgt acagttcgac gcgacctgga ccagttaa 588 
<212r> Type : DNA ... 
<211> Length : 588 *, 

SequenceName : SEQ ID 666 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgacggcag aatcctacga tgataactac ctggatgatg aagacgcgga ctggaccgcg 60 

accgggcagg ggcagaaatc tgcaggtgat accagtttta cgctggcctg gaaaccggga 120 

gaggaaggcc agaaagggct tataggctgg tttgaaagcg gcgatgtccg ggcctataaa 180 

atccgttttc cgaatggcac ggtggatgtg tttcgtggct gggtcagcag tatcggtaag 240 

gccgtgacgg cgaaagaagt gatcacccgc acggtgaaag tcactaacgt gggtaaacct 3 00 

tctgtagcgg aagaacgcag caaaattacg ccggtcagtg cgattaaggt gacgccgaca 3 60 

tccggtacgg tggcaaaagg gaaaacaacc accctgacgg tttcttttga gccggaaagt 420 

gcaacagaca agacgttcag agcggtttcc gccgatccgt cgaaagccac cattagtgtg 480 

aaagatatga caattacggt aaacggcgtg gcgacaggta aggtgcagat ccctgtggtg 540 

agcggaaatg gtcagttcgc cgcagtggct gaagtcaccg ttactgaagc gggcgctgca 600 

gggtaa > 606 
<212> Type : DNA 
<211> Length : 606 

SequenceName r SEQ XD 667 

SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgacggcag aatcctacga tgataactac ctggatga.tg aagacgcgga ctggaccgcg 60 

accgggcagg ggcagaaatc tgcaggtgat accagtttta cgctggcctg gaaaccggga 120 

gaggaaggcc agaaagggct tataggctgg tttgaaagcg gcgatgtccg ggcctataaa 180 

atccgttttc cgaatggcac ggtggatgtg tttcgtggct gggtcagcag tatcggtaag 240 

gccgtgacgg cgaaagaagt gatcacccgc acggtgaaag tcactaacgt gggtaaacct 3 00 

tctgtagcgg aagaacgcag caaaattacg ccggtcagtg cgattaaggt gacgccgaca 3 60 

tccggtacgg tggcaaaagg gaaaacaacc accctgacgg tttcttttga gccggaaagt 420 

gcaacagaca agacgttcag agcggtttcc gccgatccgt cgaaagccac cattagtgtg 480 

aaagatatga caattacggt aaacggcgtg gcgacaggta aggtgcagat ccctgtggtg 540 

agcggaaatg gtcagttcgc cgcagtggct gaagtcaccg ttactgaagc gggcgctgca 600 

gggtaa ~ ~ 606 

<212> Type : DNA 
<211> Length : 606 

SequenceName : SEQ ID 668 

SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157:H7 
<400> PreSequenceString : 

atgttatata atataccttg tcgaatttat atcctttcca ctctgtcatt atgcatttct 60 

gggatagttt ctactgcaac cgcaacttct tcagaaacaa aaatcagcaa cgaagagacg 12 0 

ctcgtcgtga ccacgaatcg ttcggcaagc aacctttggg aaagcccggc gactatacag 180 

gttattgacc aacaaacatt gcagaactcc accaatgcct ccatagccga taatttgcag 24 0 

gacatccccg gagtagagat aacagacaac tccttggcag gccgtaaaca aatccgcatt 3 00 
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cgtggcgaag catcctcccg tgttttaatt ctcattgatg gtcaggaggt aacttatcag 3 60 

cgcgccggag ataattatgg tgtgggactg fctgatagatg agtcfcgcgct ggagcgtgtt 420 

gaggtagtga aaggtccata ttccgtactg tacggttcac aggcaattgg cggtattgtt 480 

aacttcatca ccaaaaaggg aggtgacaaa cttgcatctg gagttgtgaa agctgtttat 540 

aattccgcaa cagcaggctg ggaagaatca atcgcggtcc aggggagcat cggtggattt 60 0 

gattatcgca tcaacggtag ttattctgat cagggcaatc gtgatacgcc ggatggacgt 660 

ctgccgaata ccaactatcg taacaatagt cagggtgtat ggttgggtta taactccgga 720 

aaccatcgtt ttggcctctc gcttgatcgc tacagactcg cgacacaaac ttactatgag 780 

gatccagacg gaagttatga ggcatttagt gtcaaaatac ctaaacttga acgagagaaa 840 

gttggggtat tctatgacac agacgtggac ggtgactatc taaaaaaaat tcatttcgac 900 

gcgtatgagc agaccatcca gcgccaattt gccaacgaag taaaaacgac acagcctgtt 960 

cccagtccga tgattcaggc tctgaccgtt cataacaaga ctgacaccca tgataagcaa 1020 

tacactcagg cggtcacatt gcagagtcac ttttcgctgc ctgctaataa tgaacttgtt 1080 

accggtgcac agtacaaaca agatagggtc agccaaaggt ccggfcggcat gacctcaagc 1140, 

aaatctctga ccggcttcat taataaggaa acacgaactc gctcctatta tgagtcagag 12 00 

caaagtacag tctcactatt cgcacaaaat gactggcaat tcgccgatca ctggacatgg 1260 

acaatgggag ttcgccaata ctggctttct tcaaagttga cgcgtggtga cggagtatca 1320 

tataccgcag gcattataag cgatacctct cttgccagag agtctgcgag tgatcacgaa 1380 

atggtaacat ctacaagcct gcgctattca ggtttcgata acctggagtt acgcgctgcg 1440 

ttcgcgcaag gctacgtatt tcccacactc tcccagcttt ttatgcagac atctgcgggc 150 0 

ggcagtgtca catacggaaa tcctgatctt aaggctgaac actccaataa ctttgaatta 1560 

ggtgcacgat ataatggtaa tcagtggctg attgacagcg cagtttacta ctcagaagct 1620 

aaagattata ttgcaagtct gatctgtgat ggcagtatag tttgcaatgg taacaccaac 168 0 

tcctcccgta gtagctacta ttattatgac aatattgatc gggcaaaaac atggggactg 1740 

gaaataagcg cggaatataa tggctgggtt fctctcgccat atatcagtgg caatttaatt 1800 

cgtcggcaat atgaaacttc aacattaaaa acaactaata caggtgaacc agcgataaac 1860 

ggacgtatag ggctgaaaca tactcttgtg atgggtcagg ccaacataat ctctgatgtt 192 0 

tttattcgtg ctgcctctag tgcaaaagat gacagtaacg gtaccgaaac aaatgttccg 1980 

ggctgggcca ctctcaactt tgcagtaaat acagaattcg gtaacgagga tcagtaccgg 2040 

attaacctgg cactcaataa cctgacagac aaacgctacc gtacagcaca tgaaactatt 2100 

cctgcagcag gttttaatgc agctataggt tttgtatgga atttctga 2148 
<212> Type : DNA 
<211> Length, t 2148 

SequenceName : SEQ ID 669 

SequenceDescription r 

Sequence 



<213> OrganismMTame : Escherichia coli 0157 :H7 
<400> PreSequenceString r 

atgacaaaaa tgagtcgcta cgccttgatt accgcgctgg cgatgtttct cgccgggtgt 60 

gtggggcaac gtgaacctgc accggtagaa gaagtgaaac cagcgccgga acaaccagcc 120 

gagccacaac agcctgtccc cacagtgccc tcggtgccga cgatcccgca gcagccaggc 180 

ccaattgagc acgaagatca aactgcaccg cctgcgccgc atattcgcca ttatgactgg 240 

aatggcgcaa tgcagccgat ggtcagtaag afcgcttgggg ctgacggggt gactgcgggt 3 00 

agcgtcctgc tggttgatag cgttaacaac cgtactaacg gttcgctgaa tgccgcagaa 3 60 

gcgaccgaaa cgctgcgaaa tgcgctggct aataacggga aatttaccct ggtttccgcc 420 

cagcagctgt cgatggctaa gcaacagtta ggtttgtcgc cgcaggatag tttaggcacc 480 

cgtagtaaag ccataggcat tgcccgcaat gfccggcgctc attacgtgct gtactccagc 540 

gcctctggca acgttaacgc tccgacccta caaatgcagc tgatgctggt gcagacgggc 600 

gaaattatct ggtcaggtaa aggtgccgtt tcgcagcaat aa 642 
<212> Type : DNA 
<211> Length : 642 

SequenceName : SEQ ID 670 

SequenceDescription : 

Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

atgaaaagca aagtactggc acttttaatt cctgccctgc tcggcgcagg tgctgcacat 60 

gcagccgaag tttataataa agacggcaac aaattagatc tgtatggcaa agttgatggc 120 

ctgcattatt tttctgataa ttcagcgaaa gatggcgacc agagctatgc gcgtctgggt 180 

tttaaaggcg aaacccaaat taacgatcaa cttactggct acggtcaatg ggaatacaat 240 

attcaggcaa acaacactga atcttcaaaa aaccagtcat ggacccgtct ggcatttgcc 3 00 

ggactgaaat tttcagatta cggttctttc gattacggac gtaattatgg gctggaccga 360 

tatgctgcct ga 372 
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<212> Type : DNA 

<211> Length : 372 

SequenceName : SEQ ID 671 
SequenceDe script ion : 

5 

Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

10 atggcaacac caaatcccct tgagccggta aaaggtgccg gtaccactct gtgggtttac 60 

aacggcaagg gtgatgctta tgcaaacccg ttgtcagacg atgacfcggca gcgactggct 12 0 

aaggtgaagg atctgacgcc gggcgagatg acggcagaac cctacgatga taactacctg 180 

gatgatgaag acgcggactg gaccgcgacc gggcaggggc agaagtctgc aggagatacc 24 0 

agttt-tacgc tggcctggaa accgggagaa gaaggtcaga- aagggcfctat aggctggttt 300 

15 gaaagcgggg atgtgcgggc ctataaaatc cgtttcccaa atggca_cggt ggatgtgttc 360 

cgtggctggg tcagcagtat cggtaaggcc gtgacggcga aagaagtgat cacccgcacg 420 

gtgaaagtca ctaacgtggg caaaccttcc gtggcggaag aacgcagcga aattacgccg 48 0 

gccactgcaa ttaaggtgac accgacatcc ggtacggtgg caaaagggaa aacaaccacc 54 0 

ctgactgttt cttttgagcc ggaaagtgca accgacaaga cgttcagagc ggtttccgcc 600 

20 gatccgtcga aagccaccat tagtgtgaaa gatatgacaa ttacgg-taaa cggcgtggcg 660 

acaggtaagg tgcagatccc tgtggtgagc ggaaatggtc agttccjccgc agtggctgaa 72 0 

gtcaccgtta ctgaagcggg cgctgcaggg taa 753 
<212> Type : DNA 
<211> Length : 753 

25 SequenceName : SEQ ID 672 

SequenceDescription : 



Sequence 



30 <213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

atggcaacac caaatcccct tgagccggta aaaggtgccg gtaccactct gtgggtttac 60 

aacggcaagg gtgatgctta tgcaaacccg ttgtcagacg atgactggca gcgactggct . 120 

aaggtgaagg atctgacgcc gggcgagatg acggcagaac cctacgratga taactacctg > 180 

35 gatgatgaag acgcggactg gaccgcgacc gggcaggggc agaagtctgc aggagatacc 240 

agttttacgc tggcctggaa accgggagaa gaaggtcaga aagggcrttat aggctggttt 3 00 

gaaagcgggg atgtgcgggc ctataaaatc cgtttcccaa atggcacggt ggatgtgttc 3 60 

cgtggctggg tcagcagtat cggtaaggcc gtgacggcga aagaagjtgat cacccgcacg 4r20 

gtgaaagtca ctaacgtggg caaaccttcc gtggcggaag aacgcagcga aattacgccg 480 

40 gccactgcaa ttaaggtgac accgacatcc ggtacggtgg caaaagjggaa aacaaccacc 540 

ctgacggttt cttttgagcc ggaaagtgca accgacaaga cgttcagagc ggtttccgcc 600 

gatccgtcga aagccaccat tagtgtgaaa gatatgacaa ttacgcjtaaa cggcgtggcg 660 

acaggtaagg tgcagatccc tgtggtgagc ggaaatggtc agttcg-ccgc agtggctgaa 72 0 

gtcaccgtta ctgaagcggg cgctgcaggg taa 753 

45 <212> Type : DNA 

<211> Length : 753 

SequenceName : SEQ ID 673 
SequenceDescription : 



50 Sequence 



<213> OrganismName : Escherichia coli 0157 :H7 
<400> PreSequenceString : 

atgggctgga ccgatatgct gcctgaattt ggcggtgact cttataccaa tgcagacaac 60 

55 tttatgactg gtcgtgccaa tggcgtcgcg acttatcgta atactg-attt cttcggtctg 120 

gtaaatggtc tgaacttcgc ggtgcagtat caaggtaaca acgaacjgtgc cagtaatggt 180 

caggaaggca ccaacaacgg acgtgatgtt cgccatgaaa acggtgjacgg ctggggtctt 240 

tccacaacat atgatttagg catgggcttt agcgctggtg cggcatacac ctcttctgac 300 

cgcaccaatg accaggttaa ccatactgcg gcgggtggtg ataaacjcaga cgcgtggact 360 

60 gctgggctaa aatacgatgc taacaatatt tacctggcaa ccatgtattc agaaacgcgt 420 

aatatgaccc cgtttggcga cagcgattat gctgttgcaa acaaaaccca gaactttgaa 480 

gtcactgcac agtaccagtt tgattttggt ctgcgtccgg cagtctcttt cctgatgtct 54 0 

aaaggccgtg acctgcacgc tgccggtggt gcagacaacc cggcagjgtgt tgatgataaa 600 

gatctggtta aatacgccga tgttggcgcg acttactatt tcaataaaaa catgtccact 660 

65 tatgttgact ataaaatcaa cctgttggat gaagatgaca gtttctacgc tgccaacggc 720 

atctctaccg atgatattgt cgctttaggt ctggtttatc agttctaa 768 
<212> Type : DNA 
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<211> Length : 768 

SequerxceName : SEQ ID 674 
SequenceDescription : 



5 Sequence 

<213> OrganismName : Haemophilus influenzae Rd 
<400> PreSequenceString : 

atggggttta ttatgaaact aactaaaaca gcattatgta ctgcactttt tgccaccttt 60 

10 actttttcag cgaacgcaca aacttatcct gatttgccag tggggatcaa aggaggaaca 12 0 

ggcgcattaa ttggcgacac agtttatgtg ggattaggtt ctggtggcga taaattctat 18 0 

accttagacc taaaagatcc ttcagcacaa tggaaagaaa ttgctacatt tccgggtggc 240 

gaacgcaatc aacctgttgc agccgcagtg gatggaaaac tttatgtatt cggtggttta 3 00 

.w caaaaaaatg aaaaaggcga acttcaactc gttaatgacg cttatcgcta taacecaagc, 360 

15 gataatactt ggatgaaact tcctactcgt tctccccgtg gtttagtagg atcaagtggt 42 0 

gcatctcacg gagataaagt ttatatttta ggcggctcta atctctctat atttaatggc 48 0 

ttcttccaag acactgtggc agcaggtgaa gataaagcga aaaaagatga aatcgcagca 540 

gcttatttcg atcaacgtcc agaagattat ttctttacaa cagaattatt gagctatgaa 600 

ccatcaacca ataaatggcg caatgaaggt cgtattccat tctctggtcg tgctggtgca 660 

20 gcctttacaa ttcaaggtaa tgagctagtg gttgtcaatg gcgaaattaa acctggactt 72 0 

cgcaccgctg aaacccatca aggtaaattc actgctaaag gtgtgcaatg gaaaaactta 780 

cctgacttac ctgctccaaa aggcaaatca caagatggtt tagctggtgc gctttcaggc 840 

tatagtaacg gtcattattt agtcactggc ggcgcaaatt ttccaggttc aatcaaacaa 900 

ttcaaagaag gaaaacttca cgcacataaa ggtttaagca aagcttggca taacgaagtt 960 

25 tatacgttga ataatggtaa atggcgcatt gttggggaat taccaatgaa tattggttat 102 0 

ggtttttctg tatcttacaa caataaagtt ttactgattg gcggtgaaac tgatggaggt 10 8 0 

aaggctttaa ctagcgtcaa agcaataagc tatgacggta aaaaattaac catcgaataa 114 0 



<212> Type : DNA 
30 <211> Length : 1140 

SequenceName : SEQ ID 675 
SequenceDescription : 

Sequence 

35 

<213> OrganismName : Haemophilus 
<400> PreSequenceString : 
atgggagaac aatatatgct tacgacgata 
gcgtatgttt cttggttaaa aacaaaaggg 
40 ttggcagggc gtgggttaag cggtttagta 
tcaactgaac aacttatcgg tgtcaatgcg 
gcttggacag ttccaacggt tattccgctt 
ctataa 

<212> Type : DMA 
45 <211> Length : 306 

SequenceName : SEQ ID 676 
SequenceDescription : 



influenzae Rd 

ctcagcttcc tcattgtaac cactgtggtt 60 

gatgatttaa aatcttcaaa agggtatttt 12 0 

attggctgct caatggtgct gacatcgctt 180 

gtgtcctata aaggtaattt ttccgtgatt 240 

tgtttcttgg ctctttatat aattggctgg 3 00 

306 



Sequence 
50 

<213> OrganismName : [Helicobacter pylori, strain J99 
<40 0> PreSequenceString : 

atgaaaaatc aacacaaaaa tcccctaaca aaagctttaa tgaaaactta tccatataac 60 

cattttttat ttttctgctt tattctagga gcgtttttat taggtttgct cagtccagct 120 

55 tatgctttaa gtattatcac cactaaagaa attgacgcta atttgcttaa tggagcgata 180 

gaaagcaggg tggtgttagg caagagggtg tttaaagtag aagctcatgg gttttatttt 240 

agaaacaatg cgactaacag catagatata gaaatcacca gtcttttaag agacaatcaa 300 

tcgtttcctt tgactagcag tgctaaaacc agtttaaaaa tacctcctaa cgccaagatt 3 60 

aaaaaatcca ctatccttgt tttgaaaggc gagaacgctg aagaagtggc taagatttta 420 

60 ggcgttagca aagaagaata ccaaaagcta gaaaacatcg ctcaaaccaa agcggctaat 480 

gaccctatgt atgctaacac gccttttagt aatggttctg atagttcctt ttacgataac 540 

aatcctaata gccctagcaa taacgctatc aatggcaaag atggcgcaaa tgggagtaac 600 

ggctatgggg caaatggcaa tgatggggta aatgggatca gtgggagtaa tggtgcaaat 660 

gggagtcatt caaataataa tgcaataggc agtggtattg atacagatgg cgtgttaggg 720 

65 gtggatgggg tgaatggctc tagttcttca agtggcggct ctgtaggggg ttatgagaat 780 

aatttcacta atcatggctc tactaacaat aacacaggag cjgtatgacaa ttttaataat 840 

ggcagctcaa gtggtgggag tttagggaat ggggggcttt tccctattcc ttttggtaat 900 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



ggagacacaa acaattccaa taattccact 
tctaataacg ccactaatcc tagttcgcaa 
gtgccagagt taagccccaa caacacgatg 
tgtatttcta tgaacgcttt aagagatgac 
gccggtaaag ccatcaagca aacgcaatac 
aatatcggtg gttgtgtgga tttacaaggc 
gacagcaaat gcgccttaca aaccacgagc 
acctttcaaa ctgaaatcgt gtttcgtggg 
tgcagcgatt atgcaagggt gcaagacagg 
caaaccttaa cgcctatagt ggatcagtat 
gagattttaa atcgtgggat tgccacccaa 
ggtcaatggg aatacaatga cgctaaatta 
tataacaagc ttaatggaga atgggtagaa 
. aaaagcggtg cggttgttag cccttatgtg„ 
gatattacta caagccatta ttttaggata 
tgccaaaaac tttatggagt caatcgttgc 
tcaccgattg gagcgccact tacaaaacca 
gcccagccca agataatgaa aaacacccca 
ccatcaacag gactcaaagc gttttga 
<212> Type : DNA 
<211> Length : 2007 

SequenceName : SEQ ID 677 
SequenceDescription : 

Sequence 



aacaccacta 
gaaaacaatt 
aaactagatg 
actaaatgcg 
tactatgtag 
gctcaatacg 
gataaaggtt 
atggacaatt 
attgttaggt 
tataatgatc 
ttaagctcgc 
gaagccaaaa 
gttacgccct 
atgggcgtgc 
gaaaggaaaa 
caaccgcaat 
ctaccaccca 
caacctataa 



gcccaactaa 
actccagcca 
ttatcgctaa 
cttatagata 
atagggaaaa 
ccatgcaact 
atggtatggg 
taatccatgt 
atgaaaaaaa 
ctaacaaccc 
aatatcaaga 
gacctacaat 
gtaattttga 
ctagttctaa 
attatggtga 
attccatact 
aaccactcaa 
tcttatcacc 



tggcagtagt 
gtattgtaaa 
agetfcggctct 
cgattttgaa 
taaaacgcaa 
ttacaaagat 
gaaaacgcaa 
cgctgtgcct 
tgataaaacc 
taacaagcaa 
atttgcatgc 
gcfcaaaaagc 
agcragggatt 
agtcttaagc 
gagagaacaa 
gatcctagta 
ccttatttac 
act caaacca 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgcctgtta taagagtttt agtaatgctt gcaacaatga tgatgaaatt 
gcaaaagaaa agaaagtttt taagaatgtg ggaatatcta taatggggat 
gaagcgataa aagactcgat aaaaaaacaa attaaaaaaa gcgattggat 
gttaagactg cggatgatta tttaaaaacg catcctaact catggtttaa 
ggtgtaacag cgataacagc catgcttatg aatgtgtgtt ttgctgatga 
aaagaagtgg ctcaagctca aaaggaagct gaaaacgcta gggatagagc 
gggatagaac tggaacaaga agagcaaaag acagaacaag aaaaacaaaa 
gaaaaacaaa agacagaaca agaaaaacaa aagacagaac aagaaaaaca 
caagaaaaac aaaagacaag caatatagag actaacaatc aaataaaagt 
caacaaaaga cagaacagga aaaacaaaag acaaacaata cgcaaaaaga 
aaagcagaac aaaattgcca agaaaatcat aatcaattct ttattaaaaa 
aaggctggca ttgctataga aatagaagct gaatgcaaaa cccctaaacc 
aatcaaaccc ctatccagcc aaaacacctc ccaaactcca aacaacccca 
ggatcaaaag cgcaagagct tatcgcttat ttgcaaaaag agctagaatc 
tcacaaaaag ctatcgctaa acaagtggat ttttataggc caagttctat 
gaactagatc ctagagattt taacgctaca gaagaatggc aaaaagaaaa 
cgctctaaag ctcaagctaa aatgcttgaa atgaggagtt taaaaccaga 
cacctttcaa cctctcaaag ccttttgctc gttcaaaaaa tatttgctga 
gaaataaaag tagttgctaa taccgagaaa aaagtagaaa aagcgggtta 
aaaaggatgt ag 
<212> Type : DNA 
<211> Length : 1152 

SequenceName : SEQ ID 678 

SequenceDescription : 



agtaaaaacg 
fcgotttttgg 
atgcgggaat 
ttcagcaata 
ccaatccaaa 
gaacaagagt 
gacagaacaa 
aaagacagaa 
agaacaagaa 
tttggttaac 
attaggaatt 
cacaaaaacc 
ttctcaaaga 
tctgccctat 
cgcttattta 
tttaaaaata 
cccacaagcc 
tgttagtaaa 
tggttatagt 



960 
1020 
1080 
1140 
12 0 0 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2007 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1152 



55 



60 



65 



Sequence 

<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaactacc ctaatctacc taacagcgct ttagagataa gcgaacagcc 
gaaatcacta acgagctttt aaagcaatta caaaacgctt taaggagcaa 
agcgagcaag tggaattaag ccttaaatgc atcgttagga ttttagaagt 
ttggattttt ttaagaatgc gaatgagatt gatagcagtt taagaaattc 
ctgactaacg ccggcgagag cttgaaatta aaaatgaaag aatacgagcg 
gagtttaata cgagcatgca tgccaacgag caggaagtaa ccaatacctt 
gccgagaaca ttaaaagcga aattaaaaag ctagaaaatc aattgataga 
agacttttaa cgagctatca aatcttttta aaccaagcca gagataacgc 
atcacaaaaa acaaaaccca aagccttgaa gcgattacac aagctaaaaa 
aatgaaataa gcaacaatca aacgcaagcg ataactaata tcaccgaagc 



agaagtgaaa 
cgcgcatttt 
gcttttgagt 
cat tgagtgg 
cttttttagc 
aaacgctaac 
aaccacgaca 
taacaaccaa 
caacgctaat 
gaaaacgaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gctaataatg aaataagcaa caatcaaacg caagcgataa ctaacattaa cgaagccaaa 660 
gaaagcgcta caacgcaaat aaacgccaat aagcaagaag caataaataa catcacgcaa 720 
gaaaaaaccc aagccacaag cgagatcacc gaagcgaaaa agaccgatca ttatcaaaac 78 0 

attgattttt ttgagtttga ataa 804 
5 <212> Type : DMA 

<211> Length : 804 

SequenceName : SEQ ID 679 

SequenceDescription : 

10 Sequence 



<213> OrganismName : Helicobacter pylori, strain «J99 
<400> PreSequenceString : 
. gtgaaatttt^tttcaaagga tttgtttaaa aaagtaactc ctttatttt.t ...aagcgttt at - 50 

15 tttttaagcc ccacccttac gcaagccaaa agccgttttt atgtggcttc tcaataccag 12 0 

gtggggaaaa tgatcatgaa aaaatacaac gatctcaaac gcacgattga aggggcgagc 180 
ttttctttag gctgggagat taaccccact aattactggt tttattcgcg ctattacfctt 240 
tttatggatt acgggaatgt cattctcaat aaaagaacgg gcgctcaagc gaacatgfctc 30 0 

acttacggct ttggagggga tttgatcatg gaatacaata aaaacccttt gtatgtattt 3 60 

20 tctctttttt atggcatgca agttgctgaa aacacatgga cgatttccaa acacagcgcg 42 0 

aatttcatca ttgacgactg gcgcagcatt caagggtttt cgctcaaaac ttcaaatttc 48 0 

aggatgttgg gtttagtggg gtttaaattc caaaccgtgc tattccacca tgacgctagt 54 0 

attgaagtgg ggatcaaatg gccttttgct tttgaatacg actcaccctt tgtaaggctt 60 0 

ttttctgtct ttatttcgca cactttctac ctttaa 63 6 

25 <212> Type : DNA 

<211> Length : 63 6 

SequenceName : SEQ ID 68 0 
SequenceDescription : 

30 Sequence 



<213> OrganismName : Helicobacter pylori, strain J99 
<400> PreSequenceString : 

atgaaaaaat ttaccctatc gctatttttg tgttgcacct tacttaacgc tgaagaggrat 6 0 

35 atttttagga acaacacaaa tgaaactgat cttacaaatt cttttgaaca tggcaaagaa 12 0 

aacaacaatc ttatcccagc aaaatccgat agtttagaaa gtttcaaaga acaagaaaac 18 0 

aaagaaaaag ccaaacagct tatggattta aaggccttac agagcgtgta tttttctaaa 240 
aatagaaaat tgcaagacaa taatttcaat gtcttgtatg tggcaggcaa caccaacaaa 3 00 

atccgcttac gctatgcgat gactaccacc tttatttttg ataatgatcc tattatctat 360 
40 gtgagtttag gagatcctag cgattttgaa ctcacttacc ccactaatga tcattacgjat 420 
ttgtctaaca tgctagtgat taagccgtta cttatagggg tggatacgaa cctaaccgjta 480 
gtcggagcga gcggcacaat ttatacctta ttatttgttt ag 522 
<212> Type : DMA 
<211> Length : 522 
45 SequenceName : SEQ ID 681 

SequenceDescription : 

Sequence 



50 <213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

gtgttggatt atgtgccctg gattgggaat gggtacaggt atgggaataa ccaccggcjgg 60 

agcaacagta gtacgagtgg ggtaacgacc cagggacaat cccaaaatgc atccagtaac 12 0 

gaacccgcac cgacgttttc gaatgtcggc gttggtctca aagccaatgt caacggca.cc 18 0 

55 ttaagtggtt cgcgaactac gcctaatcag caaggcactc cctggttaac cctcgaccaa 24 0 

gccaacctcc agctctgaac gggcgcgggg tgaaggaatg ataagaacgg acaaagtgac 3 00 

gaaaactaca ccaatttcgc gagtgctaag ggcagtacga accagcaggg ttcaacaacg 360 

ggtggatcag cgggcaaccc cgactcgtta aagcaggata aggctgataa aagtggtgjat 42 0 

tcggtaacgg ttgcagaagc cacatcgggc gacaacttga cgaattacac caacctcccc 480 

60 ccaacatcac ccccacatcc gactgaccga acgcgctgtc attcaccaac aagaacaacg 540 

cccagcgggt gcagctgttc ctgcgcggcc tgttgggcag catcccggtg ttggtcaata 60 0 

agagtgggca agatgataac agtaagttta attccaccga ccaaaaatgg tcttacaccg 660 

aattaa 666 
<212> Type : DNA 

65 <211> Length : 666 

SequenceName : SEQ ID 682 
SequenceDescription : 
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Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
5 <4 00> PreSequenceString : 

gtggatgata taaccgcgcc tcaaaccagc gcggggtcgt ccagcggaac tagtacgaac 60 

acaagtggtt cgcgttcctt tctcccgacg ttttcgaatg tcggcgttgg cctcaaagcg 120 

aatgtccagg gcaccctcgg gggcagacag acgacgacta cggggaacaa tattcccaaa 180 

tgagccaccc tcgaccaagc caacctccag ctctgaacgg gggcggggtg aaggaatgat 240 

10 aagactacaa gtggttcaac cggcaatgcc aatgacacca agttcacgag tgctacgggc 3 00 

agtgggagtg ggcagggcag ttcttcaggt acaaatactt ccgcggggaa tcctgatggg 3 60 

cttcaggctg ataaagttga tcaaaacggt caggtgaaaa caagcgttca agaagccact 420 

tcaggggaca acttgacgaa ttacaccaac ctcccccccg ccaacctcac ccccaccgct 480 

gattgaccga acgcgctgtc attcaccaac aagaacaacg .cgcagcgcgc ccagctgttc,.. 540 

15 ctgcgcggcc tgttgggcag catcccggtg ttggttaata agtccggcca agatgataac 600 

agtaagttta aggcggagga ccaaaaatgg tcctacaccg acttacagtc ggaccaaacc 660 

aaactgaacc tccccgctta cggcgaggtg aatgggttgt tgaatccggc gttggtggaa 72 0 

acctattttg ggaacacgcg agcgagtggt tcggggtcca acacgaccag ttcacccggt 780 

atcggtttta aaattcccga acaaagtggc acaaacacaa cgtcgaaggc tgtgctgatc 840 

20 acccccgggt tggcttgaac gccgcaagac gttggtaaca tcgttgtcag tggcaccagc 900 

ttcagcttcc agctcggcgg gtggttagtt acgttcacgg actttatcaa accccgcgct 960 

ggttacctcg ggctccagtt aacgggtctc gatgtaagtg aagcgaccca aagggagtta 102 0 

atttgggcca agcggccctg agcggccttt cgtggcagtt gggtcaaccg gctgggccgc 1080 

gtggagagtg tgtgggattt caagggggtg tgggcggatc aagctcagtt ggccgcgcaa 1140 

25 gcagctacaa gtagtaccac caccaccgca acaggggcta ccttaccgga gcacccgaat 1200 

gccctcgcgt accaaattag ctataccgac aaggattcgt acaaggcttc cactcaaggt 1260 

tcgggtcaaa ccaattccca aaacaattcg ccctacctcc attttattaa acctaagaaa 1320 

gtcgaaagca cgacccaact cgaccagggc ttaaaaaacc tgttggaccc caaccaggtt 13 8 0 

cgcaccaagc tgcgccaaag ctttggtaca gaccattcca cccagcccca gccccaatcg 144 0 

30 ctcaaaacaa cgacaccggt atttgggagg agtagtggta acctcagtag fcgtgtttagt 1500 

ggtgggggtg ctggaggggg ttcttcaggc tcaggtcaat ctggcgtgga tctctccccc 1560 

gttgaacggg tgagtggtca ctaa 1584 
<212> Type : DNA 
<211> Length : 1584 

35 SequenceHame r SEQ ID 683 

SequenceDescription : 

Sequence 



40 <213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

gtgttaaaac ttgctgttgg tatttttatt tcccccacgc tcaccaggtt tagtaccggg 60 

ttcaacctcg cggggtcggt gctcgaccag gtgttggatt atgtgccctg gattgggaat 120 

gggcacaggt atgggaataa ccaccggggc gtggatgata taaccgcgcc taaaaccggc 180 

45 gcggggtcgt ccagcggaac tagtacgaac acaagtggtt cgcgttcctt tctcccgacg 240 

ttctcgaatg tcggcgttgg cctcaaagcg aatgtccagg gcaccctggg tggcagtcag 3 00 

acgacgacta cggggaaaga tattcccaaa tgacccaccc tcgacccagc caacctccag 3 60 

ctctgaacgg gggcggggtg aaggaatgat aaggcttcaa acaaacaaag tgacgaaaac 420 

cacaccacct ttaaaagcgc tacgggcagt ggccagcagg gtggttctac aacgggtggt 480 

50 tcagcgggca atcccgactc gttaaagcag gataagatta gtaaatcagg tcagaactta 540 

accacgcagg acggcgcgcc ccagtctaat tcgacgacgg aatccgcgtc gaattatgat 600 

cacctccccc ccaacctcac ccccacatcc gattgaccga acgcgctgtc attcaccaac 660 

aagaacaacg cgcagcgcgc ccagctcttc ctccgcggct tgttgggcag catcccggtg 720 

ttggtgaatc gaagtgggtc agatgattcc aacaaattcc aagccaccga ccaaaaatgg 780 

55 tcttacaccg acttaaagtc ggaccaaacc aagctcaacc tccccgctta fcggtgaagtg 840 

aatgggttgt tgaatccggc gttggtggaa acctattttg ggacgacgcg agcgggtggt 900 

tcggggtcca acacgaccag ttcacccggt atcggtttta aaattcccga acaaaataat 960 

gattcgaagg ctgtgctgat cacccccggg ttggcttgaa cgccgcaaga cgttggtaac 102 0 

ctcgttgtca gtggtaccag cttgagcttc cagttgggcg ggtggctggt caccttcacg 1080 

60 gactttgtca aaccccgcgc gggttacctc gggctccagt taacgggctt ggatgcaagt 1140 

gatgcgacgc agcgcgccct catttgggcc aagcggccct gagcggcctt tcgtggcagt 1200 

tgggtcaacc gcttgggccg ggtcgagagt gtgtgggatt taaagggggt gtgacaagat 12 60 

caagctcagg cggccgcgca agcagctacc accgccgccg caacagggga cgccttaccg 1320 

gagcacccca atgccctcgc gtaccaaatt agctccaccg acaaggattc gtacaaggct 13 80 

65 tccactcaaa gctccggtca aaccaattcc caaaacacct ccccctacct ccatttgatt 1440 

aaacctaaga aagtcgaaaa cacgacccaa ctcgaccagg gcttaaaaac ctgttggacc 1500 

ccaaccaggt tcgcaccaag ctgcgccaaa gctttggtac agaccattcc acccaagcca 1560 
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aaccccaatc cctcaaaaca accacaccgg tgtttgggac gaatagtggt aacattggca 1620 

gtgtgcttag tggtgggggt gctggaggag cagacagcac caattcggtg gacctctccc 1680 

ccgttgaacg ggtgagtggg tggcttgtgg ggcaattacc cagtgggggt ggggggaata 1740 

gtagtgagga tattaaaagt gtgcaagaca ctcctattta tatctatatt cattagtata 18 00 

5 ttctttttaa attgttccct cactttattc atatgaacca ccgcctccct cgcgacggga 18 60 

ctcaccgtag tgggacactt cacaagtacc accacgacgc tcaagcgcca gcaatttagc 1920 

tacacccgcc ctgacgaggt cgcgctgcgc cacaccaatg ccatcaaccc gcgcttaacc 1980 

ccgtgaacgt atcgtaacac gagcttttcg tccctccccc tcacgggtga aaaccccggg 2040 

gcgtgggcct tagtgcgcga caacaccgcc aagggcatca ctgccggcag tggcagtcaa 2100 

10 caaaccacgt atgatcccac ccgaaccgaa gcggctttga ccaccgccac cacctttgtg 2160 

ttacgccgat acgacctcgc cgggcgttgt acgacctcga cttttcgaag ttaa 2214 

<212> Type : DNA 
- <211> Length :-2214 
15 SequenceName : SEQ ID 684 

SequenceDe script ion : 

Sequence 



20 <213> Organi sitiName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

gtgttggatt atattccctg gattgggaat gggcacagat atgggaatga ccaccggggt 60 

agcaacagta gtacgagtgg ggtaacaacc cagggacaac aatcgcaaaa tgcatccgga 120 

accgaacccg catccacttt ttcgaatgtc ggcgttggcc tcaaagcgaa tgtccagggc 180 

25 accctgggtg gcagtcagac gacgactacg gggaaagata ttcctaaatg acccaccctc 240 

gaccaagcca acctccagct ctgaacgggg gcggggtggc ggaatgataa ggcttcaagt 3 00 

ggacaaagtg acgaaaacca caccaagttc acgagcgcta cgggcagtgg gcagcagggc 360 

agttcttcag gtacaactaa ttccgcgggc aatcccgact cgttaaagca ggataaggtt 420 

gataaaagtg gtgattcggt aacggttgca gaaaccactt caggggacaa cttgacgaat 480 

30 tacaccaacc tccctcccaa cctcaccccc accgctgatt gaccgaacgc gctgtcattc 540 

accaacaaga acaacgcgca gcgcgcccag ctcttcctcc gcgccctgtt gggcagcatc 600 

ccggtgttgg tgaataagag tggccaagat gattccaaca agttccaagc caccgaccaa 660 

aaatggfcctt acaccgaatt aaagtcggat caaaccaagc tcaacctccc cgcttacggg 720 

gaggtgaatg ggttgttgaa fcccggcgttg gtggaggtgfc acggtctgag ttccactcaa 780 

35 ggttctagca ccggagccgg tggcgctgga ggtaacaccg gaggcgacac caatacccag 840 

acttatgcac gacccggaat cggctttaaa ttaccctcca cggactcgga atcttccaaa 900 

gccaccctga tcacccccgg gttggcttga acagcgcaag atgtcggtaa cctcgttgtc 960 

agtggtacca gcttgagctt ccagctcggc gggtggttgg ttaccttcac ggattttatc 1020 

aaaccccggt cgggttacct cgggctccag ttaacgggct tggatgcaaa tgacagcgac 10 80 

40 caaagggagt taafcttgagc ccccccggcc ctgaaccgcc tttcgtggca gttgggtcaa 1140 

ccgcttgggc cgcgtggaga gtgtgtggga tttcaagggg gtgtgggcgg atcaagctcg 12 00 

gtccgactcg caagcagcta caagtaccac caccgcaacg aaggctacct tatcggagca 12 60 

caccaatgct ttggcctttc aggtgagtta taccgaccag gattcgtaca aggcttccac 13 20 

tcaaagctcc ggccaaaacc aaaacacctc cccctacctg cacttggtgc aggggaaaaa 13 80 

45 agtaggttcc tctgataa 13 98 
<212> Type : DNA 
<211> Length : 1398 

SequenceName : SEQ ID 685 
SequenceDescription : 

50 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

55 ttgttgggca gcatcccggt gttggtgaat cgaagtgggt ccgattccaa caaattccaa 60 

gccaccgacc aaaaatggtc ctacaccgac ttacagtcgg accaaaccaa actgaacctc 120 

tccgcttacg gtgaggtgaa tgggttgttg aatccggctt tggtagaaac ctattttggg 180 

acgacgcgta cgtctagcac tgcgaaccaa aacagtacaa ccgtccccgg tatcggtttt 240 

aaaattcccg aacaaaataa tgattccaaa gccaccctga tcacccccgg gttggcttga 3 00 

60 acgccccagg acgtcggtaa cctcgttgtc agtggcacca cggtgagctt ccagctcggc 360 

gggtggctgg tcaccttcac ggactttgtc aaaccccgcg cgggttacct cgggctccag 420 

ttaagtggcc tgaatgccag tgacagcgac caaagggagt taatttgggc cccccggccc 480 

tgagcggcct ttcgtggcag ttgggtcaac cggttgggcc gcgtggagag tgtgtgggat 540 

ttgaaggggg tgtgggcgga tcaagctcag ttggccgcgc aagcagctac aagtagtacc 600 

65 accaccaccg caacaggggc taccttaccg gagcacccga atgctttggc gtaccaaatt 660 

agctataccg acaaggattc gtacaaggct tccactcaag gttcgggtca aaccaattcc 720 

caaaacaatt cgctctacct ccatttgatt aaacctaaga aagtcgaaag cacgacccaa 780 
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ctcgaccagg gcttaaaaaa cctgttggac 
agctttggta cagaccattc cacccagccc 
gtgtttggag ccatgagtgg taacctcggc 
gcaggcagca ccaattcggt ggacctctcc 
attaatagga atttttctta ttaa 
<212> Type : DNA 
<211> Length : 1044 

SequenceName : SEQ ID 686 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<40 0> Pre§equenceS£ring,„ : 

atgggccagc agggacaatc aggtacctcc gcggggaatc cagactcgtt aaagcaggat 60 

aagattagta agagtgggga tagtttaacc acgcaggacg gcaatgcgac cggtcaacaa 12 0 

gaggccacta actacaccaa cctccccccc aacctcaccc ccaccgctga ttgaccgaac 180 

gcgctgtcat tcaccaacaa gaacaacgcg catcgcgccc agctcttcct tcgcggcttg 240 

ttgggcagca tcccggtgtt ggtgaatcga agtgggtccg attccaacaa attccaagcc 3 00 

accgaccaaa aatggtccta caccgactta cagtcggatc aaaccaagct gaacctcccc 3 60 

gcttacggtg aggtgaatgg gttgttgaat ccggcgttgg tggaaaccta ttttgggaac 420 

acgcgagcgg gtggttcggg gtccaacacg accagttcac ccggtatcgg ttttaaaatt 480 

cccgaacaaa ataatgattc caaagccacc ctgatcaccc ccgggttggc ttgaacgccc 540 

caggacgtcg gtaacctcgt tgtcagtggc accagcttga gcttccagct cggcgggtgg 600 

ctggtcagct tcacggactt tatcaaaccc cgcgcgggtt atttgggcct ccagttaagt 660 

ggcctggatg ccagtgacag cgaccaaagg gagttaattt gggccaagcg gccctgagcg 720 

gcctttcgtg gcagttgggt caaccggctg ggccgcgtgg agagtgtgtg ggatttaaag 780 

ggggtgtggg cggatcaagc tcagttggcc gcgcaagcag ctacaagtga agcttccggg 840 

tcagctttgg cacctcaccc gaatgctttg gcgtttcagg tgagtgtggt ggaagcgagt 900 

gcttacagct cttcaacctc aagttcgggt tccgggtcaa gttcaaacac ctccccctac 960 

ctccatttga ttaaacctaa gaaagtcgaa agcacgaccc aactcgacca gggcttaaaa 1020 

aacctgttgg accccaacca ggttcgcacc aagctgcgcc aaagctttgg tacagaccat 108 0 

tccacccagc cccaatcgct caaaacaacg acaccggtat ttgggacgag tagtggtaac 1140 

attggcagtg tgcttagtgg tgggggtgct ggagggggtt cttcaggctc aggtcaatct 120 0 

ggtgtggacc tctcccccgt tgaacgggtg agtggtcact aa 1242 
<212> Type : DNA 
<211> Length : 1242 

SequenceName : SEQ ID 687 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 
ttgggcctcc agttaagtgg cctggatgcc 
gccaagcggc cctgagcggc atttcgtggc 
agtgtgtggg atttgaaggg ggtgtgggcg 
caagcagcta caagtagtac caccaccacc 
aatgccctcg cgtaccaaat tagctccacc 
ggctccggtc aaaccaattc ccaaaacacc 
aaagttactg cttccgacaa gttagacgac 
gttcgggtga agctgcgcca aagctttggt 
cccctcaaaa caacgacacc ggtgtttggg 
agtggtgggg gtaccacgca ggactcaagc 
gtgagtgggt ggcttgtggg gcagttacca 
aacaacctcg cgcctaatac taatacgggg 
aagcgagctt ccattgaatc aagtcgtttg 

<212> Type : DNA 
<211> Length : 771 

SequenceName : SEQ ID 688 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<40 0> PreSequenceString : 



cccaaccagg ttcgcaccaa 
cagccccaat cgctcaaaac 
agtgtgctta gtggtggggg 
cccgttgaac gggtgagtgg 



gctgcgccaa 840 
aacgacaccg 900 
tgctggagga 960 
gtcactaacc 102 0 
1044 



agtgacagcg accaaaggga 
agttgggtca accggttggg 
gatcaagctc acagcgcggt 
gcaacagggg acaccttacc 
gacaaggatt cgtacaaggc 
tccccctacc tgcatttgat 
gatcttaaaa acctgttgga 
acagaccatt ccacccaacc 
acgaatagtg gtaacctcgg 
accaccaatc aactgtcacc 
agcacgagtg acggaaacac 
aatgaggtgg tgggggtggg 
tgaatagcat taaaacctta 



gttaatttgg 60 

acgggtcgag 120 

gtccgagtcg 180 

ggagcacccg 240 

ttccactcaa 3 00 

taaacctaaa 3 60 

ccccaacgag 42 0 

ccaaccccaa 480 

tagtgtgctt 540 

cgttcaacgg 600 

ctcctccacc 660 

agatttgtct 72 0 

g 771 
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gtgcgcgata acactgccaa gggcatcact 
gatcctgcgc gaaccgaggc caccttgacc 
ctcgccgggc gcgccttata cgacctcgac 
cgtgatgcca actgccagat cacctttaac 
5 gcaccccaac agtgaaacga ggtcaaaaac 
accgatcctt atcggtttgc cgttttactc 
ttgcaaaggg gattagcgct ccctaaccaa 
accaccattg gcgcgtatgg gctgaaggtg 
aatgaaaaac tccagggcga tgaatccaag 

10 accacccaac gtggttcgac caattccgac 
aaaaagaaat cggactcgga ggacaatggt 
aacgctccca ttaagcgggg cgaggagtcg 
tttggtactg ccccttccag ttcgggatca 
ccctgaaggc cgtggcttgc gactgagcaa 

15 tcgatcctga ttctgtacga tgcgccttat 
gatcacttgg atcccaaggt gatgaccgcg 
tgaaaccacc acgggttgtg ggactgaaag 
tttgatgaaa gtaatacttc gaacaccaag 
aagtcggccc ccatcgccct cccgtttgaa 

20 tggttcgggc aagcgctttt ggtgtttggt 
accgcgcctt tgagtatttg gctttatata 
cttttagcta actctttatt atcaaaaagc 



gccggcagtg gcagtcaaca aaccacgtat 60 

accaccacct ttgcgctgcg ccggtatgac 120 

ttttcgaagt taaacccaca aacgccaacg 180 

ccctttggcg gctttggttt gagtggcagt 240 

aaggtccccg tcgaggtggc ccaagacccc 3 00 

gtgccgcgca gtgtggtgta ctatgagcag 360 

gggagttcgt caggctccgg tcaacaaaac 420 

aagaacgccg aggcggacac cgcaaagagc 480 

tcttccaatg gatcttcaag cacttccacc 540 
accaaagtca aggctttaaa aatagaggtg 600 

cagctgcagt tagaaaaaaa tgatctcgcc 660 
ggtcagtccg tccaactcaa ggcggacgat 720 
ggcggcaact ccaaccccgg ttcccccacc 780 
attcacaagg acctccccaa atgatccgcc< • 840 
gcgcgcaatc gtaccgccat tgatcgcgtt 900 
aactatccgc ccagttgaag aatgcccaag 960 

gcgcgcgatg ttttgttcca aaccaccggg 1020 

cagggctttc aaaaggaagc tgactccgac 10 80 

gcgtacttcg ccaacattgg caacctcacc 1140 

ggcaatggcc atgttaccaa gtcggcccac 1200 

tatttagtaa aggcagtcac ttttaggttg 1260 

aacatatata aaaaaacagc taattaa 1317 



<212> Type : DNA 
25 <211> Length : 1317 

SequenceName SEQ ID 689 
SequenceDescription : 



30 



35 



40 



45 



50 



55 



Sequence 

<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

gtgcgcgata acattgccaa gggcatcact gccggtagta acacccaaca aaccacgtat 
gatcccaccc gaaccgaggc caccttgacc accgccacca cctttgcgtt acgccggtat 
gacctcgccg ggcgcgcctt atacgacctc gatttttcga agttaaaccc acaaacgccc 
acgcgcgacc aaaccggtca gatcaccttt aacccctttg gcggctttgg gctgagtggg 
gctgcacccc aacagtgaaa cgaggtcaag gataaggtcc ccgtcgaggt ggcccaagac 
ccctccaatc cttatcggtt tgccgtttta ctcgtgccgc gtagcgtggt gtactatgag 
cagttgcagc gggggttagc gctccctaac caagggagtt cgtcaggctc cggtcaacaa 
aacaccacca ttggcgcgta tgggctgaag gtgaaaaacg ccgaggcgga caccgcgaag 
agcaatgaaa aactccaggg ctatgaatcc aagtcttcca atggatcttc aagcacttcc 
accacccaac gtgggggttc gtcaaatgaa aacaaagtca aggcgttgca ggtggcggtg 
aaaaagaaat ccgggagtca gggcaactcc ggtgaccaag gcaccgaaca ggtggaactt 
gaatctaatg atttagccaa cgccccgatt aaacgaggct ccaataacaa ccagcaagtc 
caactcaagg cggacgattt tggtactgcc ccttccagtt cgggatcagg cacccaagat 
ggcaccccca ccccctgaac gccgtggtta acgactgagc aaattcacaa cgaccccgcc 
aaattcgccg cctcgatcct cattctgtac gatgcgcctt atgcacgcaa ccgtaccgcc 
attgaccgcg ttgatcactt ggatcccaag gtgatgaccg cgaactatcc gcccagttga 
agaacgccca agtgaaacca ccacggtttg tgggactgaa aggcgcgcga tgttttgctc 
caaaccaccg ggttcttcaa cccgcgccgc caccccgagt ggtttgatgg cgggcagacg 
gtcgcggata acgaaaagac cgggtttgat gtggataact ccgaaaacac caagcagggc 
tttcaaaagg aagctgactc cgacaagtcg gccccgatcg ccctcccgtt tgaagcgtac 
ttcgccaaca ttggcaacct cacctggttc gagcaagcgc ttttggtgtt tgggatttgt 
ttgtcttaa 
<212> Type : DNA 
<211> Length : 1269 

SequenceName : SEQ ID 690 

SequenceDescription : 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1269 



60 



Sequence 



<213> OrganismName -. Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgctttggc ctttcaggtg agtgtggtgg aagcgagtgc ttacaagcca aacacgagct 60 

65 ccggccaaac ccaatccact aacagttccc cctacctgca cttggtgaag cctaagaaag 12 0 

ttacccaatc cgacaaagtt agacgacgat cttaaaaacc tgttggaccc caacgaggtt 180 

cgcgccagaa tgctcaaatc atttggtaca gaaaatttca cccaacccca accccaaccc 24 0 
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caagccctca aaacaacgac accggtattt gggacgagta gtggtaacct cggtagtgtg 3 00 

cttagtggtg gggggtacca cgcagggctc aagcaccacc aatcaactgt cacccgttca 3 60 

acgggtgagt gggtggaccg ctaa 384 
<212> Type : DNA 
<211> Length : 384 

SequenceName : SEQ ID 691 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<4 00> PreSequenceString : 

gtgcgcgaca acagcgccaa gggcatcact gccggtagtg aaagtcaaca aaccacgtat 60 

gatcccactc gaaccgaagc ggctttgacc gcatcaacca cctttgcgtt acgccggtat 12 0 

gacctcgccg ggcgcgcctt atacgacctc gactttagta ggttaaaccc acagacacca 180 

acgcgcgacc aaaccgggca gatcaccttt aacccctttg gcggctttgg tttgagtggg 240 

gctgcacccc aacagtgaaa cgaggtcaaa aacaaggtcc ccgtcgaggt ggcccaagac 3 00 

ccctccaatc cttatcggtt tgccgtttta ctcgtgccgc gtagcgtggt gtactatgag 360 

cagttgcagc gggggttagc gctccctaac caagggagtt cgtcaggctc cggtcaacaa 420 

aacaccacca ttggcgcgta tgggctgaag gtgaagaacg ccgaggcgga caccgcgaag 480 

agcaatgaaa aactccaggg cgatgaatcc aagtcttcca atggatcttc aagcacttcc 540 

accaccaccc aacgtggggg ttcgtcaggg gacaccaaag tcaaggcgtt gcaggtggcg 600 

gtgaaaaaga aatccgggag tcagggcaac tccggtgaac aaggcaccga acaggtggaa 660 

cttgaatcta atgatctcgc caacgctccc attaagcggg gcgaggagtc gggtcagtcc 720 

gtccaactaa aggcagccga cttcggcacc accccatcca gttcgggatc aggcggcaac 780 

tccaaccccg gttcccccac cccctgaagg ccgtggcttg cgaccgagca aattcacaag 840 

gacctcccca aatgatccgc ctcgatcctg attctgtacg atgcgcctta tgcgcgcaat 900 

cgtaccgcca ttgatcgcgt tgatcacttg gatcccaagg tgatgaccgc gaactatccg 960 

cccagttgaa gaacgcccaa gtgaaaccac cacgggttgt gggactgaaa ggccagagat 102 0 

gttttgctcc aaaccaccgg gttcttcaac tcgcggcgcc accccgagtg gtttgaccag 1080 

ggccaagcgg tcgcggataa tacccaaacc gggtttgata cagatgacac cgataataaa 1140 

aaaacaaggc tttcaaaagg aagctgactc cgacaagccg gccccgatcg ccctcccgtt 1200 

tgaagcgtac ttcgccaaca ttggcaacct < cacctggttc gggcaagcgc ttttggtgtt 1260 

tgggatttgt ttgtcttaat taactaa ~ 1287 
<212> Type : DNA 
<211> Length : 1287 

SequenceName : SEQ ID 692 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<40 0> PreSequenceString : 

atgtttggct tgaaggtgaa gaacgccgag gcggacaccg cgaagagcaa tgaaaaactc 60 

cagggcgctg aggccactgg ttcttcaacc acatctggat ctggccaatc cacccaacgt 12 0 

gggggttcgt caggggacac caaagtcaag gcgttgcagg tggcggtgaa aaagaaatcc 18 0 

gggagtcagg gcaactccgg tgaccaaggc accgaacagg tggaacttga atctaatgat 240 

ttagccaacg ccccgattaa acggggctcc aatccagcaa gtccaactca aggcagccga 3 00 

cttcggcacc accccatcca gttcggaatc tggtcaatca ggcaccccca ccccctgaag 3 60 

gccgtggctt gcgaccgagc aaattcacaa ggacctcccc aaatgatccg cctcgatcct 420 

cattctgtac gatgcgcctt atgcctttaa 450 
<212> Type : DNA 
<211> Length : 450 

SequenceName : SEQ ID 693 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgtttggct tgaaggtgaa ggatgcaacc gtggatagtt cgaagcaatc aacggaaagc 60 

ttaaagggcg aagaatcgag ttccagttcc accacatctt ccacctccac cacccaacgt 12 0 

gggggttcgt caggggacac caaagtcaag gcgttgcagg tggcggtgaa aaagaaatcg 180 

gactcggagg acaatggtca gatcgaactt gaaaccaaca acctcgccaa cgccccgatt 24 0 

aaacggggct ccaataacaa ccagcaagtc caactcaagg cggacgattt tggtacttcc 3 00 

ccttccagtt cggaatctgg tcaatcaggc acccccaccc cctgaacgcc gtggcttgcg 360 

accgagcaaa ttcacaagga cctccccaaa tgatccgcct cgatcctgat tctgtacgat 420 
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gcgccttatg cgcgcaatcg taccgccatt gatcgcgttg atcacttgga tcccaaggtg 480 
atgaccgcga actatccgcc cagttgaaga acgcccaagt gaaaccacca cgggttgtgg 540 
gactgaaagg cgcgcgatgt cctggtccaa accaccgggt tcttcaaccc gcgccgccac 60 0 

cccgattggt ttgaccaggg ccaagcggtc gcggagaata cccaaaccgg gtttgataca 660 
gatgacaccg ataataaaaa gcagggcttt cgcaaacaag gcgaacaate ccctgccccc 720 
atcgccctcc cgtttgaagc gtacttcgcc aacattggca acctcacctg gttcgggcaa 780 
gcgcttttgg tgtttgggat ttgtttgtct taa 813 
<212> Type : DNA 
<211> Length : 813 

SequenceName : SEQ ID 694 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> Pre Sequences t ring : 

atgggcagtc aaaaccaggg ttctacaaca actacctcag cgggcaatcc cgattcttta 60 

gttactgata aagttgatca aaaaggtcag gtgcaaacaa gtggtcaaaa tttaagtgat 120 

accaactaca ccaacctctc ccccaacttc acccccacat ccgactgacc gaacgcgctc 180 

agcttcacca acaagaacaa cgcgcagcgc gcccagctgt tcctgcacgg cttgttgggc 240 

agcatcccgg tgttggttaa taagagtggc gaaaataacg aaaaattcca agccaccgac 3 00 

caaaaatggt cctacaccga attaaagtcg gatcaaacca agctcaacct ccccgcttat 3 60 

ggtgaggtga atgggctgtt gaatccggcg ttggtggaaa cctattttgg gacgacgcgt 420 

acgtctagca ctgcgaacca aaacagtaca accgtccccg gtatcggttt taaaattccc 480 

gaacaaaata atgattcaaa ggctgtgctg atcacccccg ggttggcttg aacgccccag 540 

gacgttggta acctcgttgt cagtggcacc agcttcagct tccagctcgg cgggtggctg 600 

gtcagcttca cggactttgt caaaccccgc gcgggttacc tcggactcca gttaacgggc 660 

ttggatgcaa gtgatgcgac gcagcgcgcc ctcatttggg cccccccggc cctgagcggc 720 

ctttcgtggc agttgggtca accggttggg ccgcgtggag agtgtgtggg atttgaaggg 780 

Sgtgtgggcg gatcaagctc agtccgactc gcaaggatct accaccaccg caacaggggc 840 

taccttaccg gagcacccga atgctttggc ctttcaggtg agtgtggtgg aagcgagtgc 900 

ttacaagcca aacacgagct ccggccaaac ccaatccact aa 942 
<212> Type ; DNA 
<211> Length r 942 

SequenceName r SEQ ID 695 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycoplasma pneumoniae 
<400> PreSequenceString : 

atgagttttg gtttagtggg cactgttaat aataacggtt ggaaatcgcc gtttcgacat 60 

gaaacaaaat accgagcagg gtatgataag ttcaagtatt acaaaaccca ctaccggggt 120 

gcaaaaaaag ctggaaccaa tgatgatcga tggagatgaa ctgcttggtt tgaccttgac 180 

tttgcccacc aaaagatagt gctcattgaa aggggtgaac ttcaccgtca agcagattta 240 

aaaaaatctg accccgcaac aaacgaaact tccaaaaccg tttggggtag cattaaagaa 3 00 

aagctgttac aaaacgtcaa caacctccac tcagagaaag gagtattctt atggttccgt 3 60 

caatctggtt ttacaactac tagaaactag 3 90 
<212> Type : DNA 
<211> Length : 390 

SequenceName : SEQ ID 696 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atggctgaac cgttggccgt cgatcccacc ggcttgagcg cagcggccgc gaaattggcc 60 

ggcctcgttt ttccgcagcc tccggcgccg atcgcggtca gcggaacgga ttcggtggta 120 

gcagcaatca acgagaccat gccaagcatc gaatcgctgg tcagtgacgg gctgcccggc 180 

gtgaaagccg ccctgactcg aacagcatcc aacatgaacg cggcggcgga cgtctatgcg 240 

aagaccgatc agtcactggg aaccagtttg agccagtatg cattcggctc gtcgggcgaa 3 00 

ggcctggctg gcgtcgcctc ggtcggtggt cagccaagtc aggctaccca gctgctgagc 3 60 

acacccgtgt cacaggtcac gacccagctc ggcgagacgg ccgctgagct ggcaccccgt 420 

gttgttgcga cggtgccgca actcgttcag ctggctccgc acgccgttca gatgtcgcaa 480 

aacgcatccc ccatcgctca gacgatcagt caaaccgccc aacaggccgc ccagagcgcg 540 

cagggcggca gcggcccaat gcccgcacag cttgccagcg ctgaaaaacc ggccaccgag 600 
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caagcggagc cggtccacga agtgacaaac gacgatcagg gcgaccaggg cgsicgtgcag 6 60 

ccggccgagg tcgttgccgc ggcacgtgac gaaggcgccg gcgcatcacc ggcgccagcag 72 0 

cccggcgggg gcgttcccgc gcaagccatg gataccggag ccggtgcccg cccagcggcg 780 

agtccgctgg cggcccccgt cgatccgtcg actccggcac cctcaacaac cacaacgttg 840 

5 tag ~ " 843 



<212> Type : DNA 

<211> Length : 843 

SequenceName : SEQ ID 697 
SequenceDescription : 

10 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
■<4 00> Pr ©Sequences tring : , 

15 atgcgttatc tgatagcgac cgcagtgctc gttgctgtgg tcctggtggg ct^gccggcg 60 

gctggtgcgc cgccgtcatg cgccggcctg ggcggcactg tgcaggccgg ccagatctgc 120 

catgtgcacg cctcgggccc taagtacatg ctggatatga catttcctgt cgactatccc 180 

gaccagcagg cgctgaccga ctacatcacg caaaaccgcg acgggttcgt caacgtcgcg 24 0 

caggggtccc cgctgcgaga ccagccctac caaatggacg ccaccagcga acagcacagc 3 00 

20 tccggccagc cgccgcaggc cacccgcagc gtagtgctca aattcttcca ggacctcggt 360 

ggggcacatc cgtccacctg gtacaaggcc ttcaactaca acctcgcgac ctcgcagccc 42 0 

atcaccttcg acacgttgtt cgtgcccggc accacgccac tggacagcat ctaccccatc 480 

gttcagcgcg agctggcacg tcagaccggt ttcggtgccg cgatattgcc ttcgaccggc 540 

ctcgacccgg ctcactacca gaactttgct atcaccgacg acagtctgat tt-fcctacttc 600 

25 gcccagggtg agctgctgcc gtcgtttgtc ggcgcttgcc aagcccaggt gccgcgcagc 660 

gccattccgc cgctggcaat ctaa 684 
<212> Type : DNA 
<211> Length : 684 

SequenceName : SEQ ID 698 



30 SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 

35 <400> PreSequenceString : 

atgaagatgg tgaaatcgat cgccgcaggt ctgaccgccg cggctgcaat cg-<jcgccgct 60 

gcggccggtg tgacttcgat catggctggc ggcccggtcg tataccagat gcra.gccggtc 120 

gtcttcggcg cgccactgcc gttggacccg gcatccgccc ctgacgtccc giccgccgcc 18 0 

cagttgacca gcctgctcaa cagcctcgcc gatcccaacg tgtcgtttgc ga_acaagggc 240 

40 agtctggtcg agggcggcat cgggggcacc gaggcgcgca tcgccgacca ca^agctgaag 300 

aaggccgccg agcacgggga tctgccgctg tcgttcagcg tgacgaacat ccragccggcg 3 60 

gccgccggtt cggccaccgc cgacgtttcc gtctcgggtc cgaagctctc gt cgccggtc 42 0 

acgcagaacg tcacgttcgt gaatcaaggc ggctggatgc tgtcacgcgc at cggcgatg 48 0 

gagttgctgc aggccgcagg gaactga 507 

45 <212> Type : DNA 

<211> Length : 507 

SequenceName : SEQ ID 699 
SequenceDescription : 

50 Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgacctact cgccgggtaa ccccggatac ccgcaagcgc agcccgcagg ctcctacgga 60 

55 ggcgtcacac cctcgttcgc ccacgccgat gagggtgcga gcaagctacc ga_tgtacctg 120 

aacatcgcgg tggcagtgct cggcctggct gcgtacttcg ccagcttcgg ccrcaatgttc 180 

accctcagta ccgaactcgg cggaggtgat ggcgcagtgt ccggtgacac tgjggctgccg 240 

gtcggggtgg ctctgctggc tgcgctgctt gccggggtgg ctctggtgcc ta.aggccaag 3 00 

agccatgtga cggtagttgc ggtgctcggg gtactcggcg tatttctgat gcjtctcggcg 3 60 

60 acgtttaaca agcccagcgc ctattcgacc ggttgggcat tgtgggttgt gttggctttc 420 

atcgtgttcc aggcggttgc ggcagtcctg gcgctcttgg tggagaccgg cgjctatcacc 4 80 

gcgccggcgc cgcggcccaa gttcgacccg tatggacagt acgggcggta cg-ggcagtac 54 0 

gggcagtacg gggtgcagcc gggtgggtac tacggtcagc agggtgctca gcraggccgcg 600 

ggactgcagt cgcccggccc gcagcagtct ccgcagcctc ccggatatgg gtcgcagtac 660 

65 ggcggctatt cgtccagtcc gagccaatcg ggcagtggat acactgctca gcrccccggcc 720 

cagccgccgg cgcagtccgg gtcgcaacaa tcgcaccagg gcccatccac gccacctacc 7 80 

ggctttccga gcttcagccc gccgccaccg gtcagtgccg ggacggggtc gc=aggctggt 84 0 
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tcggctccag tcaactattc aaaccccagc gggggcgagc agtcgtcgtc ccccgggggg 900 
gcgccggtct a a " ~ 912 

<212> Type : DNA 
<211> Length : 912 

SequenceName : SEQ ID 700 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<4 00> PreSequenceString : 

atgaaatgtc caggcgtctc cgactgcgtt gcgaccgtaa ggcacgataa cgtgtttgct 60 

attgctgctg gtttgcgttg gtcggccgct gtaccgccgc tacacaaagg ggacgctgtg 120 

accaaactgc tcgtcggggc catcgcgggc ggaatgctag cttgcgcagc fcatattgggc. 180 

gacggaatcg cttcggccga tactgcgttg atagtacccg gtaccgcacc gtccccgtac 240 

gggccactca ggtcgctcta tcatttcaat cccgcgatgc agcctcagat cggcgcgaat 300 

tactacaacc ccaccgctac ccgccacgtc gtttcatatc caggcagctt ttggcctgtc 360 

acaggcttga attcgcccac cgtcggcagt tctgtcagtg ccgggacgaa caatctcgat 420 

gcggcgatcc gcagcactga cggaccaatc ttcgtggccg ggttatcaca gggcacgctc 480 

gtgcttgacc gcgagcaggc acggttagcg aatgacccga cggctcctcc ccctgggcaa 540 

ctcacattca tcaaggccgg cgaccctaac aatcttcttt ggcgggcgtt taggccggga 60 0 

acccacgtgc cgatcatcga ctacaccgtt ccggccccag cggaaagcca gtacgacaca 660 

atcaatatcg tgggccagta cgacattttt tctgacccgc ctaatcgtcc gggcaaccta 720 

ctcgctgacc tcaatgcgat tgccgcgggc ggatactacg gccacagcgc caccgcattc 780 

tcggacccag ctcgcgttgc gcctagggac attacgacga caacgaacag tttgggtgcg 840 

acgaccacga cctacttcat ccggaccgat cagctacctc tggtgcgggc gctggtggac 900 

atggcgggcc tgcccccgca ggcggcggga acagttgatg ccgcactgcg gcccataatt 960 

gacagggctt atcagcccgg accagcaccc gctgtgaacc cgcgtgattt ggtccagggc 102 0 

atccgcggta tccccgccat cgcccctgcc atcgccatcc ctatcggcag caccaccggg 1080 

gccagtgccg ccaccagcac cgctgccgcc acggcagcag caacaaatgc gctccgcggg 1140 

gccaacgtgg gcccgggcgc caacaaggcg ttgtcgatgg tccggggttt gctacccaaa 1200 

gggaagaagc actag 12 i 5 
<212> Type : DNA 
<211> Length r 1215 

SequenceName : SEQ ID 701 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

gtgagcctgc taccaaccct gcagtccttc ctaccgccgc ccttcgacgc cattccgaac 60 

cccatcgagg atcttgacgt tctcgtagcc gcagctgtag ccgtcgcggc tggcagtttg 12 0 

9999tatcgg cggcgcagct cggcgagatc taccgccacg acgtggtcga cgaggcccaa 180 

aaggcgccgc attgcccagc cgaatccgac cagacacccg ctggggccgc gggtgacggg 240 

gatctccctg aggtcggagg acgggtcacc agcccgccac agccgccggt cgccgcgctc 300 

accggctact ccgctaacat cggcggactc tccgtgccgc acagctggaa tcttccgcca 360 

gcggtgcgcc aagttgcggc gatgttcccc ggcgcgactc cgatgtatat gacggggagt 420 

tcggacggct cctacgccgg cctggcagcg gcgggtttgg ccggcaccgg tctggccggt 48 0 

cttgccgccc gcggtggctc cgcgccgacc ccggctgcag ccgccccggc cggagccggc 540 

ggagccggcc cggctgccac caggcccgcc gcccagcaga cgcccgcggt ccccgcggcg 600 

gccgccgggt cagccatacc tggcctaccg cccggtttgc cgcccggcgt ggttgccaac 660 

cttgcggcga ccctggcggc gatccccgga gcgaccatca tcgtggtacc gccgtccccg 720 

aacgccaatc aatag 735 
<212> Type : DNA 
<211> Length : 735 

SequenceName : SEQ ID 702 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atggacgtcg ctttgggggt tgcggtcacg gatcgggtcg cgcgtctggc gctggtcgac 60 

tcggctgcgc ccggcaccgt gatcgaccag ttcgtgctcg atgtggccga gcacccggtc 120 

gaggtgttaa ccgagaccgt ggtgggcacg gatcggtcat tggccggcga aaaccaccgg 180 

ctggtcgcta cccggctgtg ttggccggat caggccaaag ctgacgagct gcagcacgca 240 
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10 



15 



20 



25 



30 



35 



40 



45 



ctgcaggact ccggggtcca cgacgttgcc 
ctggtcgggg cggcacatgc cggctctgcc 
ttatcggtgg ttggtgaccc ggacgcgccg 
ggcgccgacg ccacatcgac cgtcgatacc 
gccccggggg atgtcttcct ggtgggtagg 
cagctgcgcg cggcgtcgac gatgcgcgtg 
gcccgtggcg cggcgatggc ggccggcgcc 
gatgcgacca cttcgctccc ccgggccgag 
ctggcgtact cgcaggccag cgattacgag 
gacgaatacg gggcagccgc ggatcgctcg 
ggcaacgctg tcgtggcctt tgcggtgatc 
gtcaccatcc gaccgaccgc ggcctcaaaa 
gggaagttca tgccgttgtt gccgacgcaa 
gatgatqcca ccgctggatt ccagggcggc 
. cggccgggta cctcacccgg ggtgggtggg 
gccgtgcccg gtgttgtgcc tgccccggtg 
ttcccgggtt ggcagcctgg aatgccgacc 
accacgtcgg cgacgacgcc gccgaccacg 
acgacgccgc cgaccacgcc ggtgaccacg 
accacgccac caacgaccgt cgccccgacg 
accgtcgccc cgaccacggfc cgctccagcc 
ccgacgcagc agcccacgca acaaccaacc 
gccccgcaga cggtggcgcc ggctccgcag 
gggggcgact tattcggcgg gttctga 
<212> Type : DNA 
<211> Length : 1647 

SequenceName : SEQ ID 703 
SequenceDescription : 

Sequence 



gtgatatccg 
gtgctgttgg 
ccgacgatgg 
ctgatggccc 
tccgccgagc 
cagactcccg 
gctacgatgg 
gcggggcaat 
ctgcttccgg 
gcgccgttga 
ggtttcgcct 
ccggtagagg 
cagcaggcgc 
accattccgg 
acgccggctt 
ccaatcccgg 
atccccaccg 
ccgccgacca 
ccgccaacga 
accgtcgccc 
accgccacgc 
caacagatgc 
ccgccgtccg 



aggcgcaggc 
tgggtgatga 
tggccgtcgc 
ggctcggcga 
acaccacggt 
acgaccccac 
cgcacccggc 
cgggttctga 
tcgacgaata 
gccgacggtc 
cgctggcggt 
gacaccaaaa 
cggtcccgcc 
ctgtacagaa 
cgcctgcgcc 
tcccgatcat 
caccgccgac 
cgccggtgac 
cgccgccgac 
cgacgacggt 
cgacgaccgt 
caacccagca 
gtggccgcaa 



cgccacggcg 
gacggcaacc 
gccggtggcg 
ccaggccctc 
tcttgccgac 
gttcgcgctg 
cctggtcgcg 
aggcgagcag 
tgaggaacac 
gctgctgatc 
ggcggtggcg 
cgcccagcca 
gcctccgccc 
cgtggtgccg 
ggaagcgccg 
cattcccccg 
gacgccggtg 
cacgccgcca 
cacgccggtg 
cgctccgacc 
cgctccgcag 
gcagaccgtg 
cggcagcggc 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

ttgaagaacg cccgtacgac gctcatcgcc gccgcgattg ccgggacgtt 
tcaccagccg gtatcgccaa tgccgacgac gcgggcttgg acccaaacgc 
ccggatgccg tgggctttga cccgaacctg ccgccggccc cggacgctgc 
actccgccgg ctccggagga cgcgggcttt gatcccaacc tccccccgcc 
gacttcctgt ccccgcctgc ggaggaagcg cctcccgtgc ccgtggccta 
tgggacgcga tcgcgcagtg cgagtccggt ggaaactggt cgatcaacac 
tactacggcg gcctgcggtt caccgccggc acctggcgtg ccaacggtgg 
gcggccaacg cgagccggga ggagcagatc cgggtggctg agaacgtgct 
ggtatccgcg cctggccggt ctgcggccgc cgcggctga 
<212> Type : DNA 
<211> Length : 519 

SequenceName : SEQ ID 704 

SequenceDescription : 



ggtgaccacg 
cgcagccggc 
acccgtcgat 
gctggccccg 
cagcgtgaac 
cggtaacggt 
ctcggggtcc 
gcgttcgcag 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1647 



60 
120 
180 
240 
300 
360 
420 
480 
519 



Sequence 

50 <213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgacgcggc tgataccggg ttgcacgctc gtcgggctga tgctgacgtt actgcccgcg 60 

cccacctcgg cggccgggag caacaccgcc accaccctgt tcccggtcga cgaggtcacc 120 

cagctggaga cgcacacctt cctcgattgc caccccaacg gcagctgcga cttcgtcgct 180 

55 ggagcaaatc tgcgcacacc cgacggcccg acgggctttc cgcccgggct gtgggcgcgc 240 

caaaccaccg agatccgttc gacgaaccgg ttggcctatc tggacgcgca cgccaccagc 3 00 

cagttcgaac gggtaatgaa ggcgggcgga tccgacgtga tcaccaccgt ctacttcggc 360 

gagggtccgc cggacaaata ccagaccacc ggggtcatcg actcgaccaa ttggtcgacc 420 

ggtcaaccga tgaccgacgt caacgtcatc gtgtgtacac acatgcaggt ggtctacccg 48 0 

60 ggggtcaacc tcacctcgcc cagcacctgc gcgcaagcca acttttccta g 531 



65 



<212> Type : DNA 

<211> Length : 531 

SequenceName : SEQ ID 705 
SequenceDescription : 



Sequence 
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<213> OrganistnName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgacaccgg gtttgcttac tactgcgggt gctggccgac cacgtgacag gtgcgccagg 60 
5 atcgtatgca cggtgttcat cgaaaccgcc gttgtcgcga ccatgtttgt cgcgttgttg 120 
ggtctgtcca ccatcagctc gaaagccgac gacatcgatt gggacgccat cgcgcaatgc 18 0 

gaatccggcg gcaattgggc ggccaacacc ggtaacgggt tatacggtgg tctgcagatc 240 
agccaggcga cgtgggattc caacggtggt gtcgggtcgc cggcggccgc gagtccccag 3 00 

caacagatcg aggtcgcaga caacattatg aaaacccaag gcccgggtgc gtggccgaaa 3 60 

10 tgtagttctt gtagtcaggg agacgcaccg ctgggctcgc tcacccacat cctgacgttc 420 
ctcgcggccg agactggagg ttgttcgggg agcagggacg attga 465 
<212> Type : DNA 
<211> Length : 465 

,Sequenc6Name :. .SEQ ID .706 

15 SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 

20 <400> PreSequenceString : 

atgatgcaac aagcggtgtc gggcattacc ggcgcgctcg gcggcgcggt cggcggcgtc 60 

atgggcccac tcacgcagct tccccagcag gccatgcaag cggggcaggg agcaatgcag 120 

ccgctgatga gtgcgcttca acagacctat ggcgcggagg gactggacgt cgcggacggg 180 

gcgcggctgg tggacagcat cgaaggtgag cccggcctcg gcggcgagcc gggcgctggt 240 

25 gacgtcggcg ccggcggcgg gggtggtggc accaccccga cgggctatct gggtccccca 3 00 

cccgtgccga cgtcgtcgcc accgacgact ccagccgggg cgccggccaa gtcggtgacg 3 60 

ccggacccgg ttagtggcac cccgcgggcg tcggggccgg ccggcatgac cggcatgccg 420 

atggtgccgc cgggcgcgtt gggtgcgggc gcggaaggag ccaataagga caagccggtc 480 

gagaagcggg tgacgggctg tgccgaatgg tcaaccggtc aagggccgct taacagtacc 540 

30 gccgagtgtt ccggtgaaat ctgccgacga caagccggtg gtcaccaagt cgacgcgacg 600 

gatccttgtt gtgccgaacg acgacaaggt taa 633 
<212> Type z DNA 
<211> Length : 633 

SequenceName : SEQ ID 707 

35 SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H37Rv 

40 <400> PreSequenceString z 

atgattcgcg aactggtcac caccgctgcg atcacgggtg ccgcgatcgg tggggcgcca 60 
gtcgcgggcg cagacccgca gcgttatgac ggcgatgtgc cggggatgaa ctatgacgct 120 
tcgctgggcg ccccatgctc cagctgggag cgcttcattt ttggacgagg cccctccggt 180 
caggccgaag cctgtcattt tccgcctcct aaccagttcc cgccggccga aaccggctac 240 

45 tgggtgatct cctacccgct atacggcgtc cagcaggtcg gtgcgccgtg tccgaagccg 3 00 

caggcggccg cgcagtctcc ggatgggttg ccgatgctgt gtctgggagc ccgtggatgg 360 
cagccgggat ggtttaccgg ggccgggttc ttccctccgg agccataa 408 
<212> Type : DNA 
<211> Length : 408 

50 SequenceName : SEQ ID 708 

SequenceDescription : 

Sequence 



55 <213> OrganismName : Mycobacterium tuberculosis H37Rv 
<400> PreSequenceString : 

atgaaaacca caggcacaac tatcaaactc ggcatcgtct ggttggtgct gtcggtgttc 60 

accgtgatga tcatcgtggt gttcgggcag gtgcggttcc atcacaccac cgggtactcc 120 

gcggtgttca cccatgtcag cgggctgcgg gccgggcaat ttgtccgcgc tgcgggcgta 180 

60 gaggtcggca aggtcgccaa ggtaacgctg atcgacgggg acaagcaagt attggtggac 240 

ttcaccgtgg atcgctcgct gtcactggat caggcgacga ccgcctcgat ccgctacctc 300 

aacctgatcg gcgaccggta ccttgagctc ggccgcggtc acagcggtca gcggctggcg 360 

ccgggtgcca cgatcccgct cgagcacacc catccggcct tggatctcga cgctctgctc 420 

ggcgggtttc gcccactctt ccaaacgttg gacccagaca aggtcaacag catcgcctcc 480 

65 tcgatcatca ccgtgttcca agggcaaggc gccaccatca acgacatcct cgaccagacc 540 

gcctcgctga cggcaacgct ggccgaccgg gaccatgcga taggtgaggt cgtcaacaac 600 

ttgaacaccg tgctggccac caccgtcaag catcaaacgg aattcgaccg cacggtcgac 660 
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aagctagagg tgctgatcac tggactgaag aacagggcgg acccgctggc cgcggcggcg 72 0 

gcacacatca gcagcgccgc gggaacccta gccgacctgc tggggcggat cgtccattgc 78 0 

tgcacagcag cttcgggcac ctcgagggca tccagcagcc gctcatag 828 
<212> Type : DNA 
<211> Length : 828 

SequenceName : SEQ ID 709 

SequenceDe script ion : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgactccac gcagccttgt tcgcatcgtt ggtgtcgtgg ttgcgacgac cttggcgctg 60 

gtgagcgcac„ccgccggcgg .tcgtgccgcg catgcggatc cgtgttcgga catcgcggtc X20 

gttttcgctc gcggcacgca tcaggcttct ggtcttggcg acgtcggtga ggcgttcgtc 180 

gactcgctta cctcgcaagt tggcgggcgg tcgattgggg tctacgcggt gaactaccca 240 

gcaagcgacg actaccgcgc gagcgcgtca aacggttccg atgatgcgag cgcccacatc 3 00 

cagcgcaccg tcgccagctg cccgaacacc aggattgtgc ttggtggcta ttcgcagggt 3 60 

gcgacggtca tcgatttgtc cacctcggcg atgccgcccg cggtggcaga tcatgtcgcc 420 

gctgtcgccc ttttcggcga gccatccagt ggtttctcca gcatgttgtg gggcggcggg 480 

tcgttgccga caatcggtcc gctgtatagc tctaagacca taaacttgtg tgctcccgac 540 

gatccaatat gcaccggagg cggcaatatt atggcgcatg tttcgtatgt tcagtcgggg 600 

atgacaagcc aggcggcgac attcgcggcg aacaggctcg atcacgccgg atga 654 

<212> Type : DNA 
<211> Length : 654 

SequenceName : SEQ ID 710 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

gtgataagca ccacaagaat tgatttccta tggatattgt cggtagcgtt cgcgtccatg 60 

attgctcttg caacgctgtt gacgcttatc aatcaagtcg tcggcactcc gtatattccc 120 

ggtggcgatt ctcccgccgg gaccgactgc tcggagctgg cttcgtgggt atcgaatgcg 180 

gcgacggcca ggccggtttt cggagatagg ttcaacaccg gcaacgagga agccgccttg 240 

gcggctcggg gctttcaaca gggaaccgcc cccaatgcct tggtgatcgg ttggaatggc 3 00. 

caccacacgg cggtgacgct gcccgatggc acgcccgtat ccagtggtga aggcggtggc 360 

gtgcgggtcg gtggcggtgg cgcctaccag cccaaattca cccaccacat gtatctgccg 420 

atggatgtgg acgcgggaga agaccagccg ccggcgccag atgagccggt caccgcggtc 48 0 

gacgacgtgg aaccggaaat gcctgcaccg tgcccgaccc agcgcccgcc ggtgaccccg 540 

agacataacc tgtgcaacaa actccggact atgccagggg cgctctcggc cgcgctggcc 600 

gcggcggcgc cggtctggcc ggcccctata agcggctgcc gcgggttcag cacgtccctc 660 

ttagcaaaaa gaaatcaccc agtaatcgtc gggaaatag 699 
<212> Type : DNA 
<211> Length : 699 

SequenceName : SEQ ID 711 

SequenceDescription : 

Sequence 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atgacgacga tgattactct tcggcgacgg ttcgcggtgg ccgtcgccgg cgtcgccact 60 
gccgccgcga cgaccgtcac cctggctccc gcaccagcaa atgccgccga tgtctatggc 120 
gcaattgcct actccggcaa cggctcgtgg ggccgatcgt gggactaccc aacccgggcg 18 0 

gctgccgaag ccaccgccgt caagtcgtgt ggctactccg actgcaaggt gctcaccagt 240 
ttcaccgcct gcggcgccgt cgccgccaac gatagggcat accagggagg agttggaccc 3 00 

accttggccg ccgccatgaa ggacgccctg accaagctcg gcggcggcta catcgacacc 360 
tgggcctgca actaa 375 
<212> Type : DNA 
<211> Length : 375 

SequenceName : SEQ ID 712 

SequenceDescription : 



Sequence 
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15 



<213> OrganismName : Mycobacterium tuberculosis H3 7Rv 
<400> PreSequenceString : 

atggccggac tgaacattta cgtgaggcgc tggcggacag cgcttcacgc 
gcattgatag ttgccatcct cggactcgcc atcaccccgg tcgctagtgc 
agggcgacgt tgtcggtgac atcgacgtgg cagaccggtt tcatcgcccg 
acaaactcga gcacggcgcc gctaaccgat tggaagcttg aattcgactt 
gaatccgtct tgcacacatg gaatagcacc gttgcacgat ctggcacgca 
agcccagcga attggaatcg catcattgcc cccggtggtt cagccacggg 
99 c gggctga ccggttctta ctcgccgccg tcgagttgtc tgctcaacgg 
tgcacctag 
<212> Type : DNA 
<211> Length : 429 

SequenceName s SEQ ID 713 

SequenceDescription : 



aaccgtgtcg 
ggcgacggcc 
cttcaccatc 
gccggcagga 
ctacgttctc 
cggcctaaga 
gcaatatcct 



60 
120 
180 
240 
300 
360 
420 
429 



Sequence 



20 



25 



30 



35 



40 



45 



<213> OrganismName ; Mycobacterium tuberculosis H37Rv 
<4 00> PreSequenceString : 

atgctgactc gcgctatcaa gacccagctg gtgttgttga cggtgttggc 
gtggtggtcc ttggttggta tttcctgcgg 
acgctttatg ccgaattgcc tcggtccggg 
cggggcatca ccatagggaa ggtcaccggc 
accatgagca tcgacaatgg ctaccagatc 
gtgtcggcgg tcggcgagca gttcgttgac 
ctgcggcatg ggcagacgat caccacgact 
gacgccgcca accgtggatt ggcagtgctg 
gaggcgtcgg aggccgtggg cgggctggga 
caggcaatcg cccacgatgt caggggcagc 
tcggcgccta tcatcgatag ccaggtcaat 
aacctcaaca cgctggccgc tcagaccgcg cagaccgatc 
gccaacgcgg caccgactgc cgatcaggtc aacgccacgt 
ttgccgcaga cgctggccaa tctcgaggtc gtaatcgata 
ggcgtcgagc aggcgttggt gttcttgccg cagtccggcg cgatcgccca 
acagagttcc ccggccaggc cggactgggt gtcggcggcc tggcgctcaa 
ccgtgcctga ccggcttcct gccggcgtcg gagtggcggt cacctgctga 
gcaccgctac ccaagggcac ctactgcagg attccgatgg acgcgagcaa 
ggagcacgca acaacccgtg tgtagacgtg cccggcaagc gggcggcgac 
tgccgcagca atgaagctta tgtgcccggg ggcaccaatc cctggtatgg 

gccggccgcg cgttgtgacc agccggtgaa 
taacaatggc atcaacccgc tgcccgccga 
ggcacacctc caccggtcaa cgatcctttg cagcgacctg ggtcaggcac 
aatgggcaac aacccaaccc gtgcgtctac accccgagca catttcctac 
gacgtgcaga gcggcaaagt cgtagcaccc gacggtgtgg tgtattccgt 
actcatgccg gagccgacgg atggaaggtg atgctggcac caaccggctg 



atacccagcc tggtcggcat 
ggtctatacc gaacagccaa 
gtcgaaccaa ccgagcgggg 
cccaccgacg cctcggccaa 
ctggtgtcga cccgcaccag 
acggtcccca gccagattgg 
cccaaagacc gggtcgcgtc 
tcctcactga atcgcctcat 
ctcgaggaca tcgacgacat 
tccggcaacg agatcgcccg 
cggcggtgcg 
tcagcgacgt 
tgctcaagcg 



cagatgctca gctgtcccgc 
gtgatcccgg cgccgtcagt 



ggtcatcgcg 
cggtcgatac 
cgtcacatat 
cgcgcgagca 
tgtgcactca 
cggtccgtat 
cccggcgctg 
ggtgctgcac 
cgaagccacc 
catcgagcgt 
ctgggccgcc 
aagcattctg 

gcgggagtcg 

ctaccacaac 
gtcggttact 
ccaaccaccg 
caccagcacc 
tgtggttcgt 
cccgcgggaa 
ggaccccaac 
gccaggccag 
tcagctgcca 
cgtccagtgc 
aaccatttac 
tgaggcttcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1551 



<212> Type : DNA 
<211> Length : 1551 
50 SequenceName : SEQ ID 714 

SequenceDescription : 

Sequence 



60 



65 



<213> OrganismName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 



atgttaaata 
atgagttcta 
aaagccatga 
tttaagaaaa 
gaagtaataa 
tctatttatg 
tcaaataaaa 
aataatatac 



atacacaatt 
taaaaaatac 
atattttttt 
attccgagat 
attctaaaga 
aaacatctat 
tatttgaagc 
ataatactgc 



tttaaatttg 
cactaatcta 
taccactaac 
tatacaaaat 
ttttaaacaa 
ggacaatgct 
cgcaaataaa 
agaacaagta 



atgaaatcct 
gatctttcat 
aaaatttcca 
aatattaata 
gctactgaat 
aaggaattag 
catattacca 
caaaaaaact 



<212> Type : DNA 
<211> Length : 480 



atatgaaacc 
ctatcaccaa 
cagaaagtat 
ctattttaaa 
atcatcaaaa 
caaatattgc 
agaatattca 
ttaataacaa 



agaattttat 
tacaattcaa 
gcaatctttg 
tagtactaaa 
atgtgtaaaa 
ttatgaagct 
taatgcttct 
atctgcttaa 



60 
120 
180 
240 
300 
360 
420 
480 
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SequenceName : SEQ ID 715 
SequenceDescription : 

Sequence 
5 

<2 13 > OrganismName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 

atgaatatta aattagttac atatttttta atattagtaa gctcattaaa agtaaatgct 60 
gatttaaatc atattcaaga tagttttaaa tatcaagaag cagagcagtt aacaatagaa 120 

10 ttaccttgga atgactgtac tgcaattcat aaattcttag aagaaaagtt atttttttca 18 0 

gaacaacaaa taaaaaaaga aaataaaatt catgagaaat ataagcaatt ttatttacaa 240 
cataataata agctttctga tttttctatg caatttctag aaaaaaaatc tgaaattaat 3 00 

agtgtcgaaa ctttaatatc aggcttttta aaattttgtg aagataattt tcaaacaagt 3 60 

asaagtaaat cgcattcttt aaattttttc . caaaaacaac aagaccaatg gttacataat 420 

15 ataagaaatg agaattataa aacatattat aagaagaaat atgaagacaa tacctttaga 480 
aatattaatt aa 492 
<212> Type : DNA 
<211> Length : 492 

SequenceName : SEQ ID 716 

20 SequenceDescription : 

Sequence 



<213> OrganismName : Rickettsia prowazekii strain Madrid E 
25 <4 00> PreSequenceString : 

atgaaaaagt tacttttaat tgctactgca 

tttgcagagt gcattgataa tgaatggtat 

aataaagaac aagataaggc aacaggtgtt 

gatttgggta ttggttatta tatttctgaa 
30 acaataggtg gaaaactcaa gaaatacgga 

aacgtttcag tgagccataa gcctactgtt 

ttaacaagtt ttgatatgtt tgatgttttc 

aaagagaaaa ttagtggggt aagcggtctt 

tcatataagc tgattttcgg tacttctgcg 
35 gcatatagct ggataaatga tggtaaaaca 

agtgtgcaaa ccggtggtat gcgttatcaa 

ggtatataa 

<212> Type : DNA 

<211> Length : 669 
40 SequenceName : SEQ ID 717 

SequenceDescription : 

Sequence 



45 <213> OrganismName : Rickettsia prowazekii strain Madrid E 
<400> PreSequenceString : 

atgaagaaga atatgagaaa gcaaatgctt aaaatcatat caatcattat tatttctctt 

ttattaagca gttgctccga atctacgcgt gatgaaaatg gattacttac agatagtcaa 

agtactataa ttcgagatta tataatatcg caaaattcta aaaatcttaa agtgaacctt 

50 aaagaaaagt ttggttccaa tttaaaagga gtaaaattaa taggaataaa gttaacaaat 

gaagatttat cgggaataga tttcacttca tgcgaaatat tacggactga cttcatgggt 

agcaacttag aaaaagcaat acttacaaat tcggtaattc aagaaagtaa ttttgcggat 

tcagtaataa aaaatatttc aggctataat gctgattttc aaggttcaat ttttaataat 

ataacattac aaaatacaaa ttttgttcaa tcaaatttca gtgatactgc ttttaataaa 

55 agtactataa tcaatgtcaa ttttgaaaat tctaaattta gtaatgtatt atggtgtcac 

agtaatattg acagtagtaa ttttcaaaaa actcatctaa aaaataatag ctttaaaaat 

actaatgtaa tgaattcaat attttatggt gcagatttag gcaaaagtgt aataaataat 

acaaatttta ctaataatta ttttgaatct agtgacctaa gtaacactaa attcacatca 

gtaatcatta aagattctaa cttcacacaa agtattttta attcagtaaa tttcaataat 

60 atacaaagta ataactcttt tttttcatat acttcctttg aagattcaac attacacaat 

attcacctta ctaaatgtga tttacaaaac agcacaatta atagttcagt tttcaataat 

tttaaaatcg acaatgctat attaacaaat atgagtctca acgataatac atttaataat 

ttatcaataa aaaatagtaa tactaatttt gtaaggatta ataaatccaa agggtttaat 

attactttac tcaatactaa ctatagtaat gctattttta gcaataatga tttaaaagaa 

65 tttaaagtca ttaatactga tttaaacaac agtgaaataa taaactcaaa tttcactaat 

ggacaattta ataatgtaaa tttttctcaa tctttaatac aaaacgtaaa ttttacagac 

gtgaaaatta ctttaggcaa tttaaatcaa gtagctctaa taaattccaa tctaataaac 



agtgcaacaa 


ttttatcttc 


tagtgtatcg 


60 


ttaagagcag 


atgcaggtgt 


agcaatgttt 


120 


aaattaaaat 


ctaataaggc 


tattccaatt 


180 


aatgtacgtg 


ctgatttaac 


tttaggaact 


240 


gcagcaacta 


atacacattt 


tactggtact 


300 


acacgtttgc 


ttattaacgg 


ttatgtagac 


360 


gttggtggtg 


gtgtcggtcc 


tgcattagtg 


420 


gcatctaaca 


ctaaaaataa 


aaccaatgta 


480 


caaattgcag 


atggtgttaa 


agtagagcta 


540 


aaaactcata 


atgtaatgta 


caaaggggca 


600 


agtcataacc 


tgacagtagg 


tgtaagattt 


660 








669 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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actaatatta ttaactcagt cctttctaat tcacaaataa ataatattaa ctaccaagca 13 2 0 

tattatagtt ttatcaatac taatgtttct aataatattg ttataaatga taattcgaat 13 80 

caaattccac caaataatat agtaatcaat tctgaaaaag atttacaaaa catatctaat 1440 

ttagcaaata tgaatttaac aaattttaac ttaagtaatt tagtgtttaa tggagtagat 15 00 

5 ttttcaaaaa gcatctttaa aaaagctaat ttaacaaata cagtaataaa aaattctatt 1560 

ttaaaagatg ctaatttttc tgcagcaata cttactaaaa cagatttctc aaaatcgata 1620 

ttaacaggta gtatatttaa gtttgctcaa attgatcaga catgttttag taattccgac 1680 

ttaacaaata ctgattttac tgaagcaaca attaaaaata ctgcatttga taatgctaat 1740 

acacacggta taaaaggatt agaataa 1767 
10 <212> Type : DMA 

<211> Length : 1767 

SequenceName : SEQ ID 718 
SequenceDescription : 

15 Sequence 



<213> OrganismName : Porphyromonas gingival! s W83 
<4 0 0> PreSequenceString : 

gtgatacaaa aatttactaa tgtaaaacta aatgatatgc gaaaaatttt gagctttttg 60 
20 atgatgtgct ctctgcattt aggtctacaa tctcagactt ggcatggaga tccggactca 12 0 

gtggcagccc taccttctat cggtattcaa gagtcaagtt gtacccgaat cacgttcgag 180 
gttgttttcc ccggatttta tagtgtggaa aaacgagaag gcaaccaagt ctttcagcgc 240 
atttccatgc cgggttgtgg ctcgtttggg aatctgggcg aagctgaatt gcctgttttg 300 
aaaaagatga tagccgttcc ggaattttca acagctaacg ttgctgtaaa aatcaaagag 3 60 

25 acggagacat tcgacaatta taatatctat cctaatccta cctatgtcgt agaggagttg 420 
cctgaggggg ggacttatct ggtagaggct ttcgcgataa acaatgacta ttatagccaa 480 
aatgtaagcc tcccttctac tcactatgtc tattctcaag acgggtattt tcgctcacaa 540 
agatttatcg aagttaccct gtatcctttt cgatacaacc ctgtccgaca agaaattcta 600 
tttgcaaaaa aaatcgaggt tacaataact ttcgataatc ctcagccacc tttacaaaaa 660 

30 aacaccggca tatttaacaa agtagcctcc tctgcattta ttaattatga agctgatggc 72 0 

aaatcggcga tagaaaatga tatggtgttc agtcgtggta caacaacgta cataagcgga 780 
aatgttgcca gcaacctccc tcagaactgt gactacttgg ttatttacga tgatatgttc 840 
.aacgtaaatc aacaaccaca cgacgaaatc aaacggctgt gcgaacatag agccttctac 90 0 

aacggctttg atgtagctgc tgtaagtata aaggacgtat tgaatagctt cccatcaaat 960 

35 gccacctcat acatcaacga aactaaactg aaaaatttca ttcgctcagt ttacaaccaa 102 0 

agcaatgcga agaggacttt agatggcaaa ctgggatacg tgctactgat cggaaaacca 1080 

ttgagcaaat atttggctga cactgataat acaaaagtcc caacctcttt tattcataat 1140 

gtctccttaa ttccaagtca tccaactttt ggttccatat gcgcctccga ctattttttt 12 00 

agttgtgttt cgccccttga tactgtcggc gatttgttta tcggtcgatt tagcgtcacc 1260 

40 aatgctcatg aattgcacaa tctgattgaa aagactatca acaaagaaat ctcatataat 1320 

cctattgcac acaaaaatat tctttacgca gaagggaaag gctgcgatgc tccaatctta 13 8 0 

cgtttattct taaaagaaat cgcctctggt tacacagtca actctatctt aaaatctaat 1440 

caggtctctg caatagactc gatatttgac tgcttgaata atggttccca tcatttttat 15 00 

tttaacactc atggaatgcc gactgtttgg gggatagggc agggactcga cgtcaatact 1560 

45 ctaacagccc gattgaacaa tacatcttcg cagggattat gtacgagtct atcatgtagt 1620 

tcggctgtag cagattcaac tattagatcg cttggagaag tcctgaccac atacgcacct 1680 

aacaagggat tctcggcttt cttaggagga agcagagcca cccaatatgc cgtttattta 1740 

gaaggcccct gtcctccgtc agaattttat gaatatttac cttattcttt atatcacaat 1800 

ctctcgactg ttgttggcga aatgttgcta tcatccatta tcaatactaa ttctgttgat 18 60 

50 acgtattcga aattcaactt caatttgctt ggcgaccctg cactaaacat tatggctcat 1920 

ggcatggagg ttagtaattg tattacacta ccaaacaaca ccattataag cagtccgata 1980 

acaataaaaa atggtggctg cctaaaaata ccggaaaaag gagttttgca ttttactaat 2040 

aatggctcca tacaagtcat gtccggagga actctggaaa taggcaatca ggctaaaata 2100 

tccggagaga ccggtgctaa ccccaccttt attaccgttt acggcgatgg tcttgcgatt 2160 

55 aacaagcagg tagagataga caatatagac cgacttaact tgttttctac gcattcggtc 2220 

atgcccaaat ttcattttga cagtgtgaaa ttcaacagtg ccccgctgta tacaacgaac 22 80 

tgtattgtgg agataagcaa ttgcgaattt accaatcgaa gtgacattat ttcaaagaat 2340 

tgtgacctaa gcgttgaaaa cagtatgttt agcagttcgg ggataacggt attcaagcct 2400 

atggctacaa gctccatcac cggattatct acaaaagcaa agattaccga caatactttt 2460 

60 tttgcgacag gaaacttcgc ctaccatatc acaaacacgc caggcttaac agcaacctcc 2520 

aatgctgcca tcaagttaga caatattcct gagtattaca tttccggtaa taaaatagtc 2580 

aattgcgatg aggctcttgt actaaataat agtggcaaca gaacgaacag actccacaat 2640 

atcacacgga atgtgataaa aaactgtagg attgggagca cgctttataa ttcctatggt 2700 

atttacaacc gaaataagat cagtaacaat catataggag tacgtctcct caacaacagt 2760 

65 tgtttttatt tcgataatgc tcctgtaatc aatgaagaag ataagcagac gtttatttct 2820 

aataggactt ggcagctcta ttcatcaaac ggtacattcc ctctcaactt ccattacaac 28 80 

agcttgcagg ggggagatac agatacatgg atttacaacg acacgtatac gaatcgctat 2940 



WO 2005/076010 



PCT/IN2005/000037 



320/341 



attgacgttt caaataatca ctggggcaac aatgatttgt ttgatccgaa tcaggttttc 3 0 00 

aatacgccag acttgttcat ttggatacct ttttgggatg gattgccaaa tgggagatcg 3060 

ggcaatagct ctgctgaagc agtagaattc caaacagcat tggactgtat tggcaatagc 3120 

gattatcttt cggcaaaagt ggctctcaag atgatggttg aaacctaccc ggaatccgac 318 0 

5 tttgcaatag ctgctttgaa ggaattgttc aggatagaga aaatgtcagg caacgattac 324 0 

gaaggcttga aagattattt cagatccaat ccaaccatca tctcttccca gaacttgttc 33 00 

ccgacagctg atttcctgtc tgcgcgatgc gatattgtgt gtgaaaacta tcagtctgcc 3360 

atcgattggt acgaaaatcg cttgaatagt gaaatctcct atcaggacag tgtttttgca 3420 

gtcattgacc ttggtgacat ttattggaat atgcagttag actcactcag agggactggt 3480 

10 atagatttga acatactttc ctgtgaacaa aggaaatcgc tcgaaagcca tcaaaatgta 3540 

aaaaattatt tgttgtcaac tcttcccgaa tcaacaggta ctctcctgcc tccattagaa 36&0 

tgcaacaaat caagccttga taaatccaag ataatctcta tttcgcccaa tccggcgaaa 3660 

gctgttgtaa caataatcta ctataccgat aacccttcct gttctgtaat aaaaatatat 3720 
ggaataaatg gagcctcggc tgatataacc gggttgccca aaqatctatc cgaaggttat ,,.37-PO 

15 tacagcatac agttcaatac atccaacttt gatcccggtt tctacctggt aacgctaaat 3840 

gttgatcaga aaattataga tacggaaaaa ttacgaatca aataa 38 85 
<212> Type : DNA 
<211> Length : 3885 

SequenceName : SEQ ID 719 

20 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

25 <400> PreSequenceString : 

ttgcagggta aaaacacgat tgtcacaacg ggggattact caattggttt gctaagtcaa 60 

acgagtggaa acctgaatac tgacaccata ataagagtca actctgacgg ttcggttacc 120 

ccctcttttt ctgacggaga tgatacattt attgttactg cgggtaatca cgcagtcggc 180 

gttcttgcgt gtgcatctcc cggaagcgcg tgtgcgtgtg tatcttctct tgatgaagaa 240 

30 agtactgccg atacaggaag taatgaaaat aatgccatag caaaactgga tatggcaaaa 3 00 

ggggagatca caacccacgg aacagaaagt tatgccgcgt acgctaatgg cactgtcgtc 360 

aaggctggtg acacgcttga ttataccaat gccagcgtta cattaaccga tgttgatatc 420 

■ accacgcatg gggacaatgc tcatgccatc gctgcccgcc aggggacagt ttcatttaac 480 

cagggagaaa tttatacaac gggtcctgac gccgcgattg ctaaaattta taatggcggc 540 

35 acggtgacgc tgaaaaatac atctgcagtc gcgcatcagg ggtctggaat tgtgctggag 600 

tcttccataa atggtcagga agcaacggta gatattttat ctggtagctc actacggtca 660 

gcaaatgaaa tcctctacca taaagatgag acgagtaacg tgaccattac ggatagtgag 720 

gtatcatcgg ctgcagatgt ttttatcaat aatattaaag gtcatttgac cgtcgatgca 780 

actaattcaa aaataacggg ttcagctaat atttcaacag atgataacac tcatacctat 840 

40 ctgtcgcttt cagataatag tacctgggat attaaagctg actcaacggt gagcaacctc 900 

acggttgata atagtacggt ctatatttcc cgggcagatg gacgggatgt cgaaccaaca 960 

agattaacaa taactgaaaa ttacgttggt aataatggcg tattgcatct cagaactgaa 1020 

ttggatgatg ataattcagc tacggataaa gtcgttatta atggaaatac ctctggaaca 10 80 

acccgagtta aagttaccaa tgcaggaggc agcggagctt acacgttaaa tgggatagag 1140 

45 attatcagcg ttgaggggga atcaaatggg gaatttatta aggattcgag gattttcgcc 1200 

ggtgcctacg aatattcatt aacccgaggt aataccgaag cgaccaataa aaactggtat 12 60 

ctgactaact tccaggcaac gagcggcggt gaaacaaact ccggaggaag ttcagcgcct 13 20 

actgttgcgc ctacccccgt cctgcgcccc gaagctggaa gttacgtcgc caacctggca 13 80 

gccgctaaca ctctttttgt tatgcgtctg aacgaccgtg cgggtgaaac gcgctacatc 1440 

50 gatcctgtaa ctgaacagga gcgttcaagc cgactttggc tacgtcaaat tggcgggcat 1500 

aatgcctggc gtgacagcaa cggacagttg agaacgacct cgcatcgcta cgtctcgcag 1560 

ttagggggcg atctgttaac cggtggtttt accgatagtg acagttggcg tttgggagtg 1620 

atggctggtt atgcccgcga ctacaactta actcattcca gcgtgtcgga ttatcgttcg 1680 

aaagggagtg tcagaggcta tagcgcaggg ctgtatgcca cttggtttgc cgatgacatc 1740 

55 agtaaaaaag gcgcatacat tgactcctgg gcgcaatata gctggtttaa aaactccgtg 18 00 

aaaggggatg aattagccta tgaatcctat agcgcgaaag gtgcaaccgt ctcgctggaa 18 60 

gcgggttacg gttttgccct gaataaatcc tttggtctgg aagcggcgaa atatacgtgg 1920 

atcttccagc cacaggcaca ggctatctgg atgggcgtcg atcataatgc gcacacggaa 1980 

gccaatggct cacgtattga gaatgacgca aataacaaca tccagacccg actcggcttc 2 040 

60 cgcaccttta ttcgtactca ggagaaaaac agcggtccgc acggtgacga ctttgaacct 2100 

tttgttgaaa tgaactggat ccataacagt aaagattttg ctgtctcaat gaacggtgtg 2160 

aaagtcgaac aagatggggt gagtaatttg ggggaaatta aacttggcgt aaatggcaac 2220 

ctgaatccag cggccagcgt ctgggggaat gtgggcgtgc agctgggtga taatggctac 2280 

aatgacaccg cagtgatggt gggcctgaaa tataagttct ga 2322 

65 <212> Type : DNA 

<211> Length : 2322 

SequenceName : SEQ ID 720 
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SequenceDescription 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Sequence 

<213> OrganismName : Shigella flexneri 2a str. 
<400> PreSequenceString : 



2457T 



gtgaccaaac tcaaacttct 
cacgctgaag gtaaattttc 
gattacgata ccgatgttta 



ttccgtggct 
accgcttact 
cacctggatg 
tacggttacc 
tgggatatgg 
ggtgtgcagt 
tccgctcgca 
ctgagcgcca 
cgtctgtctg 
tctaccggga 
<212> Type 
<211> 



taggtggtgg 
ggtcgccgct 
accgtaagag 
tgcgtaccac 
cctggttgta 



ggcacttgga gtgcttatcg 
cctgggcgca ggcgtaggtg 
cccagtaccg gtaatcaact 
ttactacctg tggaatgacg 
ttacttcaaa gcgaaagaca 
caccatgatg gctggtctgt 
cctggctggc gataccctgg 
tcgttacacc aacggtggcc 



caacgtctgc 
tcgttgagca 
atgaaggcga 
caacggataa 
gtggcgatca 
cttatgctca 
ataacagcaa 
tgaccgtgac 



ggaacagcga aaaccagaac gaatactatt atggcgtatc 
gcggtctgcg tggctataac ccgaatgaca gctggagccc 
gctacaactt cctcggcgac tggagtgttt acggtaccgc 
atgaagttac tgacagcccg atggtggata aatcctggac 
tcacctacaa attctga 
: DNA 
Length : 747 
SequenceName : SEQ ID 721 
SequenceDescription : 



aggcgtagcg 
cccatataaa 
taacttctgg 
actttcaatt 
ccaaatgcgt 
ctttacccag 
cggcattgtc 
fcccgggtatt, 
gcgcaaagag 
ttaccttgag 
gcgctacacc 
tggcctgatt 



ggaatgctgt 
gaatacacca 
aactggacac 
cctctcgggc 



tggtttctgc 
atattggtgt 
ataacgacga 
cgttaatggc 



Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 
atgaaaaaaa ttgcgctagc aggtctggcc 
gcaatgagca tcagcggcca ggcgggtaaa 
actgaatcga cgggcctggc tfctaagcggt 
gtcgcgggcg tggggctggg gttgaatctg 
ggaaaaggcg tgtacaccaa cccgaattac 
ggtggtttgc agtggaaaat tggcaacagc 
ccggattcgc tctccagcgg tattcaaagt 
accattatgc gtccagtcag tattgaggcg ggttatcgct acctgaatct 
gacggtaacc gcgacaacgc tgtggctgac ggcctgtatg ttggggttaa 
tga 

<212> Type : DNA 
<211> Length : 543 

SequenceName : SEQ ID 722 

SequenceDescription : 

Sequence 



ggcgatgaag gttatgccgc 
ttccgtttgt ttggcgagta 
tatgaggaag cgaatgctgg 



atcggtcaat 
cggttttggg 
cgacggtgac 
gaccgttggc 
agcggtagga 
ttactactct 
cgcgcgttac 
gtcgggtaaa 
cgccagtttc 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<40 0> PreSequenceString : 

atgactacct tgaccgccag ggtgtttaca acggcagaga 
tgtgtcattt aaactgtagt agacaggaga 
ttgctatcat gatggcttca tttgccgcaa 
catttaaaag tggtaccgga gcaattgata 
caggtacagc atggtacaag ctagatacac 



atcgcattgg 
ataatggcgc 
actcctgtgc 
ttaggtagcg 
actgcgttag 
ggcaatctgt 
aatgacgtac 
ccgatgggca 
ggtgttaacc 
tcaaccgcta 
ttcaataagt 
tcgccctggt 
attaatggcg 
ggtaataatt 
ggttttgcgg 
tcacgagaaa 
agcactgata 
ggtcgggcct 



ctgcattccc tggtggaccg agagagcaag 
atgtgtttgg cggcattggc aaaaacagcg 
acaaatacaa ccccaaaacc aatagctggg 
tggcgggtca tgtgactttt gtacacaacg 
agaatatctt caatggctat tttgaagatc 
tagataaaat caacgcccac tattttgaca 
ttctgttgtc ttttgatccc tcaacacagc 
acggaacggc tggtgcggcg gttgtgaata 
aagccaaacc aggattgcga acggatgccg 
taaaatggaa taagcttgat cccgtctcat 
ggataagcaa tgattctctt atatttgctg 
attaccagaa cggtaagaac tatgcgcatg 
ttcatctttg gcataacggg aaatgggata 
acggagtatc attgccctgg aataatagtt 



actgcaggcg gcaaagcggt gacggattca gttttaattt 



taatttatcg 
cagtcacaat 
acgcgtctgt 
acgacactgt 
aggccaaaga 
caacctctgc 
agggattgac 
ttaaattgat 
gcaaggctta 
tcaacgaggc 
aaaaagcaga 
aatggagtta 
aaggtgataa 
tatttgaact 
caccagatgg 
gaggggccgg 
aaggcctgaa 
aatcgggtga 
tattaattat 
ctgtgaagga 



gaaaacagtt 
gaataaaact 
attaccggaa 
ctacattggt 
taaaaaatgg 
atttattgat 
tcaggtattt 
gtcgcacgcg 
tgttactggc 
tggaaaagat 
agattatttc 
cgctggcgaa 
aacctggctt 
tgatttcacc 
cgtcgctggc 
attcaaaggt 
aaaatcatat 
attatcgcaa 
tggcggtgaa 
taataaagtc 



60 
120 
180 
240 
300 
360 
420 
48Q 
540 
600 
660 
720 
747 



60 
120 
180 
240 
300 
360 
420 
480 
540 
543 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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acagtacaaa actaa 1215 
<212> Type : DNA 
<211> Length : 1215 

SequenceName : SEQ ID 723 
5 SequenceDe script ion : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 

10 <4 00> PreSequenceString : 

atggcaactg gcggcgctgc cctggcaggt aaagcggtta tgggtgccgc agccggtgca 60 
gccggaggtg caagtgcact ccaagcggct tttcagaaag catcagcgag tatggaaacc 120 
ggcggtgaca tgtccagcat ggggtcagtt gtcagcagtg gcggaaacgg tggtggtgaa 180 
gcgggtactg ctggcagtag cccattcgcc caagcggctg gctttggtga cagcggcagt; -24 0 

15 agctcaagcg gtggcggctt tgccaaggcc gcgaagctgg ccacaggcac ggcctccgag 3 00 

ttagccaagg gtgtcggctc tcaagtgaag cagggattcc aggagcgagt gagcgaaacc 360 
acaggcggaa aactggctgc ttcgatacgc gaaagcatgg agccgaaaga agcaagccaa 420 
tctggccagt tcgagggcaa tagcttgggc gccgattctg gcccagatag taacgaagtc 480 
aggagttag 489 

20 <212> Type : DNA 

<211> Length : 489 

SequenceName : SEQ ID 724 
SequenceDescription : 

25 Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaagcgag ttcttattcc tggcgtcatt ttatgtggcg ctgatgtggc gcaggccgtc 60 

30 gatgacaaaa acatgtacat gtattttttt gaagagatga cggtctatgc tcctgtccct 120 

gtacccgtaa acggcaacac gcattacacc agtgaaagca tcgagcgttt accgaccggg 180 

aatggcaata tcagcgatct gctgagaacc aaccctgcgg tacgcatgga ttcaacgcaa 240 

agtacctcgt tgaaccaggg agatattcgc cctgagaaaa tctctattca cggtgcgtcg 300 

ccctaccaga atgcctattt gattgacggt attagtgcaa ctaataacct gaacccagcg 360 

35 aatgagtccg atgccagtag tgcaaccaat attagcggga tgtcacaggg gtattatctt 420 

gatgtcagct tactggacaa tgtgacgctt tatgacagtt ttgtgccggt tgaatttggt 480 

cgcttcaatg gcggggtaat tgatgcaaag atcaaacgct tcaacgctga tgatagcaag 540 

gtgaaattgg gttatcgcac tacgcgtttg gactggttaa catcgcatat cgatgagaat 60 0 

aacaagagcg catttaatca aggttcttca ggaagtacct atttctctcc agattttaaa 660 

40 aagaactttt ataccttgtc gtttaatcag gagctggctg ataactttgg cgttaccgcc 720 

ggtttatcgc gccgccagtc tgatatcacc cgcgcggatt atgtttcgaa tgacggcatt 780 

gtcgccggtc gggcacagta taaaaacgtt atcgatactg cattgagcaa atttacctgg 840 

tttgccagcg accgctttac ccacgattta accttaaaat ataccggctc cagccgtgat 900 

tataatacca gcaccttccc gcagtctgat cgcgaaatgg gtaataaatc ctatggtctg 960 

45 gcatgggata tggatacgca actcgcatgg gccaaactac gtaccaccgt tggttgggat 102 0 

catattagtg attatacccg tcacgatcat gacatctggt acaccgaact ttcatgtaca 1080 

tatggtgata ttacagggcg ttgtacccgt ggcggattag gacacatttc ccaggctgta 1140 

gataattaca ccttcaaaac acgcctggac tggcaaaaat tcgccgtggg tgatgtttcg 12 00 

catcaaccct acttcggcgc ggaatacatc tattccgatg catggactga acgccataac 1260 

50 cagtctgaat cctatgtgat taatgctgca ggaaagaaaa ctaaccatac catttaccat 1320 

aaaggtaaag gcagcctggg aattgacaac tacacgctgt atatggcgga tcacattagc 13 80 

tggcggaatg tgtcgttaat gcccggtgtg cgttatgact atgacaacta tctgtcaaac 1440 

cacaatatct ccccgcgctt tatgacggaa tgggatattt ttgctgatca aacctcaatg 1500 

attaccgccg gttataaccg ttactatggc gggaatattc ttgatatggg attacgtgat 1560 

55 atccgcaata gctggacgga atcggtatca ggtaataaaa ccctgacgcg ttatcagaat 1620 

ttgaaaacgc cttataacga tgaactggca atgggattgc agcaaaaaat cgataagaac 1680 

gttattgcac gcgcaagtga agcgcatgat caaatcagca aaagcagtcg taccgacagc 1740 

gcgactaaaa ccaccattac tgaatataac aacgatggca aaaccaaaac gcattcgttt 18 00 

aacctcagtt ttgaactggc cgaacccctg catatccgcc aggtagatat taacccgcaa 18 60 

60 attgtcttta gctatatcaa gagcaagggc aacttgtcgt taaacaatgg ttatgaggag 1920 

agcaataccg gtgataacca ggtggtttat aacggtaatc tggtctctta cgatagcgtt 1980 

ccagtggcag attttaataa cccattaaag atctccttaa acatggattt cacgcatcaa 2040 

ccgagcgggt tagtgtgggc gaatacgctg gcctggcaag aagcgcgtaa agctcgcatt 2100 

atccttggta agacaaatgc gcaatacatc agcgaatatt cagattacaa gcagtatgtt 2160 

65 gacgaaaaac tggatagcag cctgacctgg gacacccgct tgtcctggac gccacaattt 2220 

ctgaaacaac aaaacctgac gatcagtgcc gatattctca atgtactgga tagcaaaacc 22 80 

gctgttgata caacgaacac cggtgtggcg acctacgcca gtggccgtac tttctggctt 2340 
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gatgtcagca tgaaatttta a 

<212> Type : DNA 

<211> Length : 2361 

SeguenceNarae : SEQ ID 725 
SequenceDescription : 



2361 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Sequence 

<213> OrganismName : Shigella flexneri 2a str. 2457T 
<4 00> PreSequenceString : 

atgaagaaaa cattgttagc aatcatgctg gcggggacag cttttgcttc 
actttagtaa gccagggtac tgaagcgtca gctaatctta ctttgaccaa 
gttaataata ccattcaacc agtgaaaggg gtatatagcg gtacgttaac 
cctttggcaa caggtattgt tggcgcatct gatgggcaga gccacgacta 
ttcccggatg atatttatgc ggagagcagt acttcagcgg atgcagtaat 
aataacccgg accataaact gaaagtttcc ctgacaacgc ttgagcagga 
gctgcctccg aagagattgg cggtaagcgt tatatgatgc tgaaaaatac 
ggtgcctatc gcgtcgtttc ccatatgaaa gaacaggttg ttgagccaga 
atccgtaccc aggcttatat ctacgcagaa taa 
<212> Type : DNA 
<211> Length : 513 

SequenceName : SEQ ID 726 

SequenceDescription : 

Sequence 



tcaggcggga 
gcctatcgtt 
tgcatggacg 
tgctgtaact 
ttctggtgac 
tcctccttca 
tggaacaggt 
ctcttacacc 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgggtattt accactggtc ccggaagaca aaaatgaaac ggacaaaatc 
gcatcgttcc gcaaaaactg gagcgcacgc catctgacac cagtcgctct 
actgttttta tgctggctgg ctgtgaaaag agtgatgaaa cagtgtctct 
gctgacgact gttcagctgc aaacccaggc aaaagcgccg aatgtaccac 
.aatgcgctga aagaagccga acgtactgcg ccgaaatacg ccacccgtga 
gctgaatttg gtgaaggtca gtgtcagcaa gcaccagccc aggctggcat 
aaccaggcgc aggctcagca atccagcggg agtttctgga tgccgctgat 
atgatggggc gtctgatggg cggcggcgcg ggatttgcac agcagccgct 
aaaaacccgg ccagtccggc ttacggtaaa tataccgacg cgacgggtaa 
gcagcccagc caggccgcac catgaccgta ccgaagacgg caatggcacc 
accaccacta ccgttacccg tggcggtttt ggtgaatctg ttgccaaaca 
cagcgtagcg caaccggcac ctcttctcgt tcaatgggtg gctga 
<212> Type : DNA 
<211> Length : 705 

SequenceName : SEQ ID 727 

SequenceDescription : 

Sequence 



catacgccac 
cgcggttgcc 
ctatcaaaat 
cgcgtacaac 
agactgtgtt 
ggcaccagaa 
ggccggttac 
gttctcctcg 
aaactatggc 
aaaaccggcg 
aagcactatg 



<213> OrganismName : Shigella flexneri 2a s 
<4 00> PreSequenceString : 

atgacaaaaa tgagtcgcta cgccttgatt accgcgctgg 
g fc 99ggcaac gtgaacctgc accggtagaa gaagtgaaac 
gagccacaac agcctgtccc cacagtgccc tcggtgccga 
ccaattgagc acgaagatcg aacggcaccg cctgcgccgc 
aatggcgcaa tgcagccgat ggtcagtaag atgcttgggg 
agcgtcctgc tggttgatag cgttaacaac cgtactaacg 
gcgaccgaaa cgctgcgaaa tgcgctggct aataacggga 
cagcaactgt cgatggctaa gcaacagtta ggtttgtcgc 
cgtagtaaag ccataggcat tgcccgcaat gtcggcgctc 
gcctctggca acgttaacgc tccgacccta caaatgcagc 
gaaattatct ggtcaggtaa aggtgccgtt tcgcagcaat 
<212> Type : DNA 
<211> Length : 642 

SequenceName : SEQ ID 728 

SequenceDescription : 

Sequence 



tr. 2457T 

cgatgtttct 
cagcgccgga 
cgatcccgca 
atattcgcca 
ctgacggggt 
gttcgctgaa 
aatttaccct 
cgcaggacag 
attacgtgct 
tgatgctggt 
aa 



cgccgggtgt 
acaaccagcc 
gcagccaggc 
ttatgactgg 
gactgcgggt 
tgccgcagaa 
ggtttccgcc 
tttaggcacc 
gtactcctgc 
gcagacgggc 



60 
120 
180 
.240 
300 
360 
420 
480 
513 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
705 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
642 
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<213> OrganismName : Shigella flexneri 2a str. 2457T 
<400> PreSequenceString : 

atgacgaaac ttatgcaatt tgttcagagg tgctattata tgactaataa gaaaatgtat 60 

tttatattaa tacttgtttt cacattgcta caggtgtgtt tttttgctct ctggaaggct 12 0 

5 cgtgatggta gcacaacgtc tcttgaatgt acgtcgacgt taacaagaaa tgctaaaaca 180 

gatcattctt tatactactc tgctaacctt tcagtcattt taaaaaaaga tggtagcggt 240 

agttttacga ttgttggctt aactgatgaa gatacaccga ggaaattttc ccattcgtat 300 

ttttttacct acaaaataga tagtaacgga cggatctcag gtaatgctaa ggccaaagtt 360 

tcgggactgg aaaatcagat aaaggatgag aacttcagac tcaattttct cgatgcatct 42 0 

10 ttaacaggta aaggcaatgc gaggctaagc aagtttaata atgtctatat ttttagtatc 480 

ccggggttga tcattaacac atgtgctcca atataa 516 
<212> Type : DNA 
<211> Length : 516 

SequenceName : SEQ ID 729 

15 SequenceDescription : 

Sequence 



<213> OrganismName : Shigella flexneri 2a str. 2457T 
<40 0> PreSequenceString : 

atgggcagga taagctcggg aggaatgatg tttaaggcaa taacgacagt cgccgcactg 60 

gtcatcgcca ccagtgcaat ggcgcaggat gatttaacca ttagcagcct tgcaaagggc 12 0 

gaaaccacca aagctgcgtt taatcagatg gtgcaagggc ataagctacc tgcatgggtg 180 

atgaaaggcg gtacttatac tcccgcacaa accgtaacgt tgggagatga gacgtatcag 240 

gtgatgagcg cgtgcaaacc gcatgactgt ggctcgcaac gtatcgctgt gatgtggtcc 300 

gagaaatcta atcagatgac ggggctgttc tcggctattg atgagaaaac gtcgcaagag 3 60 

aaactcacct ggctgaatgt gaacgatgcg ctttcgattg atggtaaaac ggtgctgttc 420 

gcggcgttga ccggcagcct ggaaaaccat ccggatggct ttaattttaa ataa 474 

30 <212> Type : DMA 

<211> Length : 474 

SequenceName : SEQ ID 730 
SequenceDescription : 

35 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaaaaaac aatttttgga aaaagctgtg tttactgttg cggctacggc agcaacagtt 60 
gttttaggaa ataaaatggc tgatgcagac acttatactc ttcaagaagg agattctttt 120 
ttcagtgttg ctcaacgata tcatatggat gcttatgagt tagcttctat gaatggaaaa 180 
gatattacca gtctgatttt gccgggtcag actttaactg ttaatggttc ggcagcaccg 240 
gataatcagg cggcagcgcc aactgacact acgcaagcaa ccactgaaac gaatgatgcg 300 
aatgccaata cttatcctgt tggtcaatgt acttgggggg ttaaagctgt tgcaacttgg 360 
gcaggcgact ggtggggcaa tggcggtgat tgggcctcta gtgcttctgc acaaggttac 420 
actgtcggta acactccggc agtagggtct attatgtgtt ggacagatgg tggttatgga 480 
catgttgcct atgtcacagc tgttggtgaa gatggtaaag ttcaagtact ggaatccaat 540 
tataaagatc aacaatgggt tgataactat cgtggttggt ttgatccaaa taatagtgga 60 0 

acaccaggca gtgtcagtta tatttatcct aactaa 63 6 

<212> Type : DNA 
<211> Length : 636 

SequenceName : SEQ ID 731 
SequenceDescription : 

55 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgtctatta aaaatatttt agaaaacaaa acaacaacaa ttaaagttag ttttgcagga 60 

60 attgcaacag cagctagttt aattttacct atggcagtac aggcagaaac tacttatact 120 

gtgaaatcag gagatacttt atcagagatt gcttcaacac acggaacaac tgttgataaa 180 

cttgctaagt taaataaaat taataatatc catcttatcc atgctggtca aattttagaa 240 

ttagatgcag caacagaaga tactgatgca acgccagtac aagaaagtca gataaatgaa 3 00 

gcagaaacct cagcatctgc caaaactagt caaacgagcg aagtgacgac aacagcaccg 3 60 

65 gtacaagaaa gccaaacaag tgaagttata acttcagcac cagctgaaac cagtcagaca 42 0 

agcgaagtgc caactgaagc caaccaaaca aatgaagtaa gttcagcagt atcggttgaa 480 

accagtcaaa cgagtgaggc gacgacttca gctccagtgg aaactagtca gacaagcgaa 540 



20 
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gcaacgacag cggaaccaac tgagaccaag 
gctgaagaaa accaaacaac atctaatact 
aaagaattca tcgctcaaaa agaatcaggt 
tatggacgtt atcaattgag tgattcttac 
gaacgtgtag cagatgctta tgtatcaagt 
ttctggaatg ctaatggttg gtattaa 
<212> Type : DNA 
<211> Length : 867 

SequenceName : SEQ ID 732 
SequenceDe script ion : 

Sequence 



accagccaaa 
agcggtttga 
ggtaattata 
ttgaatggtg 
cgttatggtt 



caaatgaagt 
gcacatctga 
atgctaaaaa 
atttgtcaga 
catggactgc 



agcagcttca 
tgcagctgca 
tggtcaatat 
agaaaatcaa 
tgctcaagct 



<213> QrganismName : Streptococcus, mutans UA159 
<40 0> PreSequenceString : 

atgaaatgtc aagcgtttga agattttaaa gctacaagtt tgaataagct 
acaggtggag ctactgatgg tgaaattata gctaatcgga tgttacaagg 
aagggcgaaa ttactatgta tacttggaac attattcaaa atggctgggt 
gtgtcttggg gtattggtgg ttataatagt tctataggat actcagctca 
ggatttagta attatccata cgatgtttct atggattcag ataatagcag 
aatacgacag gtggttatgt taattataac cagagtttta attctggatg 

<212> Type : DNA 
<211> Length : 3 54 

SequenceName : SEQ ID 733 

SequenceDescription : 

Sequence 

<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgcgctatt cacaaatttg tcgtaaaagt ttggctttgc tggctacagg 
actacctcaa ccttacctag tataagtatt ctagctgagg acagtactgg. 
aggccagatg gacaagctcc tgctggaggt ggtgctaaca ccacaactta 
ggaatcaaca gtggtgttct agttgctaat ggtagcaagg tcacttccag 
aaatcaacta cttccgccca aaatacagct cttgttcaaa acggtggtag 
cataaagcga atttaataaa atccggtgat gataacaatg gtgacaatga 
ggtattaatt ctattttact agcagtcaat gaaaggtcaa aagcttatgt 
aaactaaaag ctagtagctc tggcagtaat gggatttttg caactgataa 
tacgctaata aaacaagcat tgcgactaca gctgataatt cacggggact 
tacaatggca atattattgc taataagatg gccatttcta caaaaggtgc 
gctattgcaa ctgaccgtgg tggcggcaat atttccacca ctaattccag 
agtggctctg gctcacctct tctttattca acaggcaata ttcaagttaa 
ggaacatcta gtaacagcca aattgctggt atggaaggtc ttaataccat 
aattctaatt taattagtac catgacaaac aaaactgcca gtgacccgat 
gttatcatct atcagtcaca atccggtgat gccgaagcaa caacggggca 
ttcgagctca gcaagtctaa attaacttct tccattactt caggttctat 
acgaatacct ctgcaaacat tatccttaat caatccacct tgaattttga 
gctaaacttt tgactgtagc aggcaatagt gccaataatt ggggaacccc 
ggggcaacag ttaactttac tggccataag cagacactta aaggggatgt 
agtatttcga ccttaaatat gtacctgctt gataaaacca actacactgg 
gtatcaacca acagtaccaa tatatcccca agcacgtctc ctattaccat 
aaaaattcca aatgggtgct aactggtcat tcgacagtaa ccaatctcaa 
ggtgctaaaa ttgttgataa agacggaaaa accgtcagcg tcatctcttc 
aaacttgtta aaggtaaaag caaatatagc ctaacagtca caggaactta 
gtaacaacca gctcaagtaa caaacccagc agcagttaca ttaaccgtag 
aattatttta aaacaacaac agcctttgta aataatacca aaaatacaag 

<212> Type : DNA 
<211> Length : 1557 

SequenceName : SEQ ID 734 

SequenceDescription : 



gtcttatacg 
taaagctact 
gaattcactc 
aggtaataga 
tagttcaagt 
gtaa 



aatgatttta 
agcacctgct 
tgattacagc 
ttccaaaacc 
tttaacactg 
taatttttac 
ttcgaattcc 
ggcaaccatc 
tgatgccact 
tcacagcgct 
tttaaatact 
tcacgttaca 
tcttattcat 
tgccaatggc 
aagtgcccac 
gttctacctg 
tgcaaataag 
cggtagtaat 
tgatgtggat 
caaaactgct 
gaatatttct 
tgctgaaaaa 
aagcggacaa 
cagtcaaaag 
tgacttcgat 
taattaa 



600 
660 
720 
780 
840 
867 



60 
120 
180 
240 
300 
3 54 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1557 



65 



Sequence 

<213> Organi smName : Streptococcus mutans UA159 
<400> PreSequenceString : 
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atgaacaaga taggtgatac tctacgtgat gctcgtattg aaaaaaaatt gagttttgat go 

gatgttgtag ataaaactgg aattgctcct cactatatac tggcaatgga gttagatcaa 120 

cttaaattat taccagaagg caaaacaaat gagtatttag aaaaatatgc tcatgctgtt 18 0 

ggcttagatc cggtttctat tattcatggt tatcgcaatc aggaaatgag cgatgaactt 240 

atcctacctt cttctgcaga attggctgct tcttcagata gcaatataga aaagaaaaat 3 00 

gaaggaaaat caattgaaga acctcaagag ttagctattg atagtttgga tgttactcaa 3 60 

aatattacag aagagactcc ccaaatagaa gattttaagg ttgaatcaga ggaggcaagt 420 

aaaaaaatag aaaaaatacc atctcgactt agtaaatatg attatgatga agaaccgaaa 480 

aagaaatttc catgggcttt aattctgcta attttacttg ctttaactat tatcagttat 540 

gttggatatg tggtttataa tcagctgcaa actgattcaa ataagactga attaagtacc 600 

tctacgaaaa aatctaagga tactaaaaat gatgccaatt caacgacaca aagtcagacc 660 

agcataacaa cagactttgc tgatggtgga aataatatca ccttaagtaa tactaatgga 720 

aaagtagagg ttacctttac tttgacaggg gatgaggaaa gttgggtttc agcaactaac 780 
actactgatg gagaatctgg aacaactcta. acagcaacag ataaaactta tactgttact, . ,^,840 

ttagcagaag gttcaacaac atctatgctg acagtcggat cacctagcgg tgttgaaatt 900 

acgatcaatg gtcaaaaggt ggatactaca aaccttgtta atgctggttt gactaatatt 960 

aatttgacag tccaataa 978 
<212> Type : DNA 
<211> Length : 978 

SequenceName : SEQ ID 735 

SequenceDescription : 



Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgaagagca gaaaacgtca gcgaaaaggg ctggttcgaa aaaatgaaat cattatttta 60 

actctttttg tcgcatcagc cgtttcactc ttagcattta ccaattcttt tggggtactt 120 

gctaagagtc ttcatcttga aaaaatcaat aaatcgataa ctatttcgct tccttttggt 180 

aaaaaaaaga tggaacaaac agcacgttac tattcaggtg aacaggttca gatcagttcc 240 

tcagctaaaa aagacagtct tggaaaaggg ctgtctcatt accaaaattg gattgggaca 300 

gttaaaaaga taaaatcaca aaaagacagt cgtcaaaagc atcattatag ttatgaagtt 360 

acatttgata atggcaaagc tttgaaatat gttcaggaaa aagacttagt, taaaacaaaa 420 

agatctaaat acagcaaagg tcaaattgtc aaattaaaat cctctgcgac agctgattta 480 

gatggcagca gtttaacgga ttatcgtgca tcagctggta aaatcgatca tatttcttat 540 

aatcatagca ataccacagg tggctataag tatgatatca cctttgatga aggcggcaag • 600 

gtgactaata ttcaagaaaa agatttggat aaggtttatg aagttcagct aaaatcagaa 660 

aatactgcag ctcaaaataa tgagattctt aaacaagctt ttgcatatgc taaacaacat 720 

tctggtacta tcttaagttt gccaaacggt gaatttaaga ttggcagtca gacaccagac 780 

aaggattata tcactttaac atctgatact gaaattcgtg gagataatac gacacttttg 840 

gttgaaggat cggcttattg gtttgctttt gcaactggga catcagctag tgatggtgta 900 

aagaatttca ccatgcgcaa tatcaatata aaagccagtg atttggaaaa aggaaatcag 960 

tttatgatta tggctgatca tggtgataat tggaaaattt gtaacaatag ttttaccatg 1020 

gtgcataaaa aaggaagcca catttttgac ttaggttcct tgcaaaattc agcttttgaa 10 80 

ggaaatcaat ttactggcta tgcccctgaa ttaactaatg tcagtaagat tgatgataat 1140 

gcagatcttc atgattttta ttcagaagtt atccagctgg atgccgctga gtctagcggt 12 00 

gtttgggatg gtgggttgat aaaggctatt gatccaaatt atgaaaatta taataaagag 12 60 

aaacagttgt gcaataacat cacgattgct aataactctt ttgttcctta tatagacagt 1320 

catggtaaaa taatagcata cagcggaact atcggacagc actcttctga tgtcggcctt 13 80 

gtcaaaattt atgataatgt ttttagtaac tctttggtca gccgtttcaa tcaaaatggc 1440 

aaaagcgagg cgtggatttt taaagctatt cacctaaaat caaattataa taatgctgtt 150 0 

tatgccaatt ctatcagtta a 1521 
<212> Type : DNA 
<211> Length : 1521 

SequenceName : SEQ ID 736 

SequenceDescription : 



Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgagaaaac ttaaagtggc actttttgca agcagtattt taggaatgct ggctgttagt 60 

tcttatacgg cagcagatac agaggataat caggtaacga ttagccatta taatgaacag 120 

gctggaactt ttgatgtcaa cgctgtacag gcagccaacg gaaaaactat tcaatcgata 180 

gatgttgcga tttggtctga agaaaatggg caggatgatt tgaaatggta tcatgccagt 240 

aatgatggca gcaatcaatt gacagttcat tttaatgctg agaatcatgg cagtaaggta 3 00 

ggttcttata ttgcgcatgc ttatattacc tatacagatg gtaatcgagt cggggttaat 3 60 
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ttgggaaaac gaaaattatc cttatctgca 
caactatttt ctaagctgaa acctagtgca 
gatgagaatg gtcaagatga tcttcattgg 
gctggctatg ctaatcataa aggttatgga 
5 aatggtaaga tgataccaat tagtgctcaa 
attcagattg ataaaataaa tgataccagt 
tatattagtt cagtagccat tcctgtgtgg 
tggtatcagg caacaaaagt ggctgatggt 
catcgttttg aattaggcaa ctatcaagct 

10 aaactggatg gtttaggaga aactcatttt 
cctcaggtaa ctattgatca ttataatatt 
gaaacagata attcaaaagc gatacaatca 
caagctaatc tttattggta tgaagctaaa 
gtcgatgttc aaaagcatgg caatcaaaca 

15 tataatgatg gcacgactag cggacatgtt 
cattatcaac cttctgcagt aagaataaca 
gttggtcagt gtacttgggg agtgaaagaa 
aatggcgggc agtgggcaag tactgtagct 
aaagttggtg ctattgcttg ttggagtgat 

20 cacgttgaga gtaataaccg tattcaagtg 
atttccaatt ttcgcggatg gtttgatccc 
atttatcctg actaa 
<212> Type : DNA 
<211> Length : 1635 

25 SequenceName : SEQ ID 737 

SequenceDescription : 



ccgcaattat 


ccttaaaaca 


aggtggcctt 


420 


gcggatcaac 


ttttttcagc 


agt t tggt eg 


480 


tacacggcag 


atgctgacgg 


gaatactttg 


540 


acttaccatg 


ttcatactta 


ccttaagcaa 


600 


gat at t gat a 


ttcctaaacc 


gaaagtcaag 


660 


tatgatgttg 


ttgttaataa 


tgtcccccct 


72 0 


agtgaacaaa 


atggccaaga 


cgatttgaaa 


780 


atatttaaaa 


caactgttta 


tttaaagaca 


840 


catatttatg 


gcga t ag c c a 


attaagcaag 


900 


aatgfctccgt 


ctattattaa 


ctatgaagat 


960 


aacaaaggaa 


cgtttgatgt 


gactgtagct 


1020 


atcagtgctg 


ctgtttggtc 


tgatgetaac 


1080 


cagctagcaa 


atggaaaggc 


tgcaattact 


1140 


ggaagctaca 


atgtccatgt 


ttafcgttcat 


*. . 12 00 


ttggctaatc 


agcagctcaa 


tcaaattgtc 


1260 


gcctatatga 


atgaaaaaaa 


tacttatcca 


1320 


ttagctcctt 


ggatacctaa 


ttggcttggc 


1380 


gttaagggat 


tcaaaatagg 


aactgttcct 


1440 


ggtggttatg 


gccatgttgc 


ttacgttacc 


1500 


aaagaagcta 


attataagaa 


tcaacaatat 


1560 


acgacttcct 


atttgggaag 


attaacttat 


1620 








1635 



Sequence 



30 <213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString t 

atggcaaata actattctcg tegtcaacaa cccactaaaa aaacaaaggg 
aaaegtcega ctgaacatat caaaacaggt ttttcagcac tgcaaaagag 
atcgctggta ttttagggat tattaccget ttgattacta ttaataatta 

35 tcacacaatg ataaaaaaga ttccacatct aaaaccacta tcatcaaaga 
gatgactcaa atagtaacaa caatgetget aattctcaag ctgaaaatga 
aataataatt ctgeagaate aaatcaaaac caaactgeaa caacagcaaa 
ageaattegg ctaatcaaaa teaagecaat agccaatcac aagcaaataa 
caaaacaatg etaatgetgg tcaataa 

40 <212> Type : DNA 

<211> Length : 447 

SequenceName : SEQ ID 738 
SequenceDescription : 



gaeaagtegg 
tgttgctatt 
tegcaatagt 
aaaagaagtg 
cagcaataac 
tgacagtaac 
tcagcaaaat 



60 
120 
180 
240 
300 
360 
420 
447 



45 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> P resequences t ring : 

ttgaaaatat ttagttttgg aaccattcga aacaacacag ctctaaaacc taactatgat 60 
50 gacacaacag ettttagegg ttttggaacc attcgaaaca acacagctct aaaacagagc 12 0 

actaactgeg ctagctggtt caatcgtttt ggaaccattc gaaacaacac agctctaaaa 180 
ctcaccatat taattaatgg cgtttccttt tgttttggaa ccattcgaaa caacacagct 240 
ctaaaacctc gtggaccaat ttttgtctcg acatttegta atcgcgccat tcatctcagc 3 00 

cagatttcag cttcaaaatg a 321 
55 <212> Type : DNA 

<211> Length : 321 

SequenceName : SEQ ID 739 
SequenceDescription : 

60 Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 

atgaaaagaa aacgaaatct ttactttctt attggtttat ttttgacagt ctttcttttg 60 

65 ataggatget caatgeagaa aaaaaccaaa tcagaaagca gctcgacttc tcaaaagact 120 

actttacaaa caaaacagtc aagtgaaaaa tcaactgatg ctaagcaaac gacagaagct 180 

cattcagaaa gcagtcagtc ttcttctcat tctaataacg aggaaaccct tgctcccatt 240 
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gatacaggcg ctgttttaaa ggctgattac agtagtatgg caggaacttg gaaaaatgaa 3 00 

gaaggacaaa cgttgacatt tgatcagcga ggtctgacaa cccctggaat gacagtcagt 3 60 

ctgttgaaca ttgatcaaga cggaaatctt ttgttaaatg ttgagactgg aacaaaaaag 420 
aatctaactc tttatattgt gccagccaat aaaaccttat ctaatcaata tttttctaat 480 
5 ggtcaaagcg atgaatccga taaaacaaaa gatcgtattg tttcttctga gagtttaaat 540 
agtggcaaat ttacaaaccg agtttattat catgtttcaa ctcattaa 58 8 

<212> Type : DMA 
<211> Length : 588 

SequenceName : SEQ ID 740 
10 SequenceDescription : 



Sequence 



<213 n. Organi smName ? Streptococcus . mutans UA159 

15 <40 0> PreSequenceString : 

atgacaccta aaaaaatcaa aatagctcta acagctctta tctctttgat gctcgcctta 60 
tttttattct tgtttaatca ccattcagta agagaaaaca gtcagcaaga aaagttaaag 120 
ataagtaaag caagttctaa aaaatcacaa acaagcactt cttctgttat gacaagtagt 180 
cgaaaagcta ctgaacaaac aagccaagca cagactcaaa gtcaatcaca agcagaacaa 240 

20 agtaacccta atgtgatcct ccccattccg caagaattag tcggcaccta caaaggttcg 3 00 

agtccacaag catctgaaat aacttttacc atttcttcaa atggtcaatt acgtgctcaa 3 60 

gctaattttg atcctgcttc tgatataaat gacgttaccg ccactgttag tggtgttaga 420 
aaagtcgggg cagataccta tatttgggag tttgtctctg gtagttcagc tgctctttta 48 0 

ccgggtgtta caggtatagg agggcttgga aagatgcagc ctggtttcat cctaaaaggg 540 

25 gggcaattaa cacctatcat gtttacaggt tctgtagatg gtgaaattga ttattcacat 600 
cccaatccct atccagtatc attaaacaag cagtaa 636 
<212> Type : DNA 
<211> Length : 63 6 

SequenceName : SEQ ID 741 

30 SequenceDescription : 



Sequence 



<213> Organi smName t Streptococcus mutans UA159 

35 <400> PreSequenceString : 

atgaaaaaaa taattaatgt tatcgtctta tcactatccg tttttttcct gatagcttgc 60 
agcaatagca gtaccgggga aaaaacaagt caatcatctg aagagactaa ggtccgatta 120 
attgttaaaa cggattctaa taaaaccgat gaaaaagtcg ctttcaaaaa aggtgctact 180 
gttatggatg tcttaaaaga caactataaa gttaaagaaa gcggtggttt tatcactact 240 

40 attgatggtg tcactcagga taaaaaagca ggtaggtact ggatgtttga tgttaatgat 3 00 

aagctggcat caaaagctgc tgataagatt aaggtcaaaa atggggataa aattgaattt 360 
tatttgaaag tttataaagg taagaactag 390 
<212> Type : DNA 
<211> Length : 390 

45 SequenceName : SEQ ID 742 

SequenceDescription : 



Sequence 



50 <213> OrganismName : Streptococci 
<400> PreSequenceString : 
atgtcaaata aaccatggga agaaaaagta 
acaagaaatt caaaggatgc tagtattatc 
ctctttttct tgattattat tggtatttta 

55 agcaatgaaa aagcagccac ttcgggtttc 
aaaaatgagg caaatagtca aactgatgaa 
gaaacgacaa gttcctcttc agacagtgat 
ggggctgcag caattgctgc gcgtgcgggt 
ccagaacata tgacgcatgg ctattggtat 

60 taa 

<212> Type : DNA 
<211> Length : 483 

SequenceName : SEQ ID 743 
SequenceDe script ion : 

65 



is mutans UA159 

actgatgcaa ctactgataa tgaagaaatg 60 

agtacaccta ttttaacaat cctattgagc 120 

ttttttgtac tttatacttc aaatggtgga 180 

tatagttctt ccaaaacggt caaaaaagcc 240 

cagacaacag aagcagaaac aagttcaagt 3 00 

ggcgagacaa ttacagttca aggaggtgaa 360 

atttctgttg ataaactcta tgagctgaat 420 

gctaaccctg gagataacat caagattaag 48 0 

483 



Sequence 
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<213> OrganismName : Streptococcus mutans UA159 
<400> PreSequenceString : 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



atgccagata atcgcatgaa 
attactttgg aaacaggaga 
cccagtgtca ctctcaatac 
ggagcaattg gtcgttctgt 
aatgctagcg atggtaaatt 



gaattgggag 
gctcaatatc 
ctttttgtta 
aaaaaaatag 
agtagggatt 
ggtgaaggcg 
ctgcagoagt 



aaaaacaata 
aaatgaaagc 
tgacaacaga 
aattacagaa 
tgaactatga 
ttgtcaatac 
t.tgctggtgt 



ctatagtatt gatagcaata tgcagtttcc 
atttgcttat attcaacgcg gtagcatggt 
caaagtcaat ggacgtggtt caggacttgg 
aacgtctgga gaaagttttt tcattactca 
ggccttggcc ccttctatgc cgggccaagt 
tcgcctcaat gatggtgctt ttcttgcctt 
tcagagtgtt ggacgtgccc tttttggcgg 
aggtcaaggt accttgcttg ctaatagttt 
tcaggaaatt acaattgaca atgctcatgt 
cattcatttg gaaaatggct ttatgcaatc 
tttccgagga acgggtgaaa tttatgtaca 
cctacagggt ttcattacca atactaatcg 



cttggtagaa 
ctatcacaca 
caagctagta 
ggcagtatca 
tattgcttta 
agatggttct 
tcaaggcggt 
tggttctatc 
tgtagcttgg 
gatcggaacc 
aagtcttaat 
ttaa 



<212> Type : DNA 

<211> Length : 714 

SequenceName : SEQ ID 744 
SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus mutans UA159 
<4 00> PreSequenceString : 

atgaaaaaaa attatttttg gtacggtctg cttgggcttc tcgcacttta 

attgctttta tcccgggatt tcatattttc tttagcaaca tgttgatgtt 

tttatgttaa tagctttgag taacaggagt atcttctttt tctttctagc 

cttagtatct acttgaaaga tatctttcat tttgactatt ctaccggacc 

ggtataatca ttatcggcgt tattttaaac agtttcctca aaccacacta 

tataaaggaa atcattattt taatatgaaa caacatgcta actacattga 

gatgtctttt taaaaacact tttttctgaa aataccagtt atgtgacttc 

aataaaatta ttattgatac taagtttgga gaacaatctg ttgatctctc 

tttatgacag attctcccga aattcatata gatgttagct ttggtgaaac, 

attccaaaca actggaaaat catcaataaa actcactccc cctttgcttc 

tcaggttttc ctagcacaaa tggtgatttt attaacgtta cattgactgg 
atgggatctc taaacattca atattaa 
<212> Type z DNA 
<211> Length : 687 

SequenceName : SEQ XD 745 

SequenceDescription : 

Sequence 



ccttattact 
ggccctgttc 
cttaggtttt 
gctttttacg 
ttcttattct 
taacgaaaca 
tcaagaatta 
tcaagctcaa 
caatctgcgt 
catttcattt 
aacagtggct 



<213> OrganismName : Streptococcus pneumoniae R6 
<4 00> PreSequenceString : 

atgaaatcaa taactaaaaa gattaaagca actcttgcag gagtagctgc 
gtatttgctc catcatttgt atctgctcaa gaatcatcaa cttacactgt 
gatacacttt cagaaatcgc tgaaactcac aacacaacag ttgaaaaatt 
aaccacattg ataacattca tttgatttat gttgatcaag agttggttat 
gtagcgcctg ttgcaacacc agcgccagct acttatgcgg caccagccgc 
actgtttcag ctccagtagc agaaactcca gtagtaagtg aaacagttgt 
agcggatctg aagcagaagc caaagaatgg atcgctcaaa aagaatcagg 
acagctacaa atggacgtta tatcggacgt tacggttcat ggactgctgc 
tggcttaaca atggctggta ttaa 
<212> Type : DNA 
<211> Length : 504 

SequenceName : SEQ ID 746 

SequenceDescription : 

Sequence 



cttgtttgca 
taaagaaggt 
ggcagaaaac 
cgatggccct 
tcaagatgaa 
ttcaactgta 
tggtagctat 
taaaaacttc 



<213> OrganismName : Streptococcus pneumoniae R6 
<4 00> PreSequenceString : 

atgaaacatt cacataaaaa atcatttgac tggtatagca tgcaacaacg ttattctatt 
cgtaagtatt actttggtgc agctagtgtc ttgctcggta ccgctttggt attaggtgca 
gcagctagtg tccaaacagt acaagcggaa gaaaacaaac aagaaactac caatagtatt 
tctgttggta ggggagaagc agctactaaa ccagcagagg tttctgcgtc taataaagag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
, 714 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
687 



60 
120 
180 
240 
300 
360 
420 
480 
504 



60 
120 
180 
240 
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aaaacctatg cagctccaac tgtagctaat 
gaagttacta aaccagcaga aaaagttgaa 
catcaagatg ccgttgacaa gtcaaaatta 
gaaagcaagt tatatacaga agcaagtgct 
5 caaagcttgc ttggaaaagc agatgcaact 
attcaatcat ttattattgg tctagaactt 
gaaacgcctg tagcgaagaa agctgatgca 
gcaacaactg aacgttcagc tgttgatagc 
gtagaaacaa cttcagctcc agcatctatt 
10 tctgatgctc gccaaaatcc agctatccgt 
tttagagcgg ctagcaatcc agccaaccca 
tttgcagata ttagccaagg tggtcgtagc 
ggtggaaatt ctattcatta cgatgtaaca 
acaattagtt attctacgcc aggagattct 
15 gataaagggg atggatttgg aaatccttca 
gtaagggagc aatcaaaatc tattagtcag 
agtatgacct ctgctafcttc aacaaatacg 
attaatctaa atggtgattt gtctgttcgfc 
ggtggtggtg gtgccgctac tagcaatgac 

20 gcaaacccac tatacttgga tgcaaaccct 
gaagacattg acttccaaac tgtctatctt 
agattagttc gagaaggtga aaaaggacaa 
ggtaacgaaa cacttttagg attgccgatt 
cgtattatgc aaattggtgt ggctaaagat 

25 caaaataaag tcggtgatac aaataacctc 
gtttatactg aaggtgtaga cgaacttgtt 
ggtgaaaaag gagaccaagg tgaacgcggt 
cgaggcgaac gcggtctaac tggagctcaa 
gaacgcggtc taactggagc tcaaggagct 

30 ggtctaaccg gagctcaagg agctaaaggt 
actggagctc aaggagctaa aggtgaaaaa 
gcacaaggag ctaaaggtga aaaaggggac 
ggtgccaaag gtgaaaaagg ggaccaaggt 
aaaggtgaaa aaggagccca aggtgaacgc 

35 gaaaaaggag accgaggcga acgcggtctg 
ggagaccgag gcgaacgcgg tctgactgga 
caaggtgaac gcggtctaac cggagcccaa 
gaacgcggtc taactggagc acaaggtgaa 
ggagctaaag gtgaaaaagg ggaccaaggc 

40 aaaggagccc aaggtgaacg cggtctaacc 
gaccaaggtg aacgcggtct cactggagcc 
ggtcgtgacg gtgtaactcc aaccgtaacc 
actatcacta ttaacgacgg tagaggtaat 
gatggcgcaa gtccattagt tgcgactcaa 

45 atcttctatt acgataaaaa tggaaacaat 
gaagttgtaa ttgcagatgg tgctaaaggt 
caagggcgtg atggtgaaca aggaccaaaa 
gtaactgatg gtcaagatgg aacgcataca 
actactacag tagtaagaga tggatttgat 

50 aatgaagcag ataaaacaac aactgttatt 
tttgatgaag gagatacaaa acttaaagaa 
aaaggtgaca aaggtgataa cggaaaagat 
aacaataacg gaacacacac catcacaatc 
acaatcgtta aaaacggtga agatggaaaa 

55 gcgaagaaac aaacaacatt aacattctac 
gcaggaaaag acgagttagt tcaaacaaca 
ggagcatctg gacgtgatgg aaaagaagta 
ggtaaagatg gagacacatt cgtaaataca 
aacacttggg aaccagcagg aaacatcaaa 

60 gccaagggtg aaaaaggagc ccaaggagaa 
ggtgaaaagg gagatcaggg tgaacgcggt 
caaggtgaac gcggtttaac tggagcgcaa 
cttcaaggtc gtgatggagc tcaaggacca 
ggaccacaag gaccaaaagg agaacaaggt 

65 tctctaattg ctgttaaaaa tggtgtatta 
caaacaacat ttgtagagga tggacaaaag 
ataactgagg ggcaaaacgg cacacataca 
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ccagtagaaa cgactccagt taaaactgaa 3 00 

gaagcaaaag acaaaaaaga ggaagtaacg 3 60 

ttaacggctc ^tttcgcgtgc taaaaaatta 420 

gctaacttgc aaacaagtat ccaagctggt 480 

gaagctgaat tatcagcagc agagtcatct 540 

cgttctaact ctaataaaga aactgtatca 60 0 

gttgaatcaa aagaaggggc taaaccagct 660 

gctattttgc caactagcac agctgacaaa 720 

aatgaaatct tgaaactagg tttgagcctt 780 

aaggaagatg ttaatagagg gtatagtggt 840 

attgtctcag gttctggaaa tacagttgca 900 

tatagtttcc gtggctacgg gaactcacgt 960 

acagtacgta gtggtaatag tgttaacttt 1020 
.agagagtttg ttaataataa ttttatcttg - 1080 

aatgcaacga tcacaagttc aaatccaaga 1140 

ggtgccaact acgtatctca ctcaggatac 12 00 

gaacaaacta tcagatttag tttgcctatc 1260 

ttgaaacctg ttacttttaa cgtggaccaa 1320 

ccatacagta actctaacta ttattataga 1380 

tatggtggta ctaataataa gactgtttca 1440 

ccaactagta agttaccaga aggtcaaact 150 0 

cgtcaaatta cctataaagt tcatcgattt 1560 

agtaatagtg ttactaaaga agctaagcca 162 0 

ctaatcgata cagtaaaacc acgtgttgat 1680 

actttctatc ttgataacga tggaaacggt 1740 

caaaaaattg ctattaaaga tggagctaaa 1800 

ctaactggag ctaaaggtga aaaaggagac 1860 

ggagctaaag gtgaaaaagg agaccgaggt 1920 

aaaggtgaaa aaggagaccg aggtgaacgc 1980 

gaaaaaggag accgaggcga acgcggtcta 2040 

ggagcccaag gtgaacgcgg tctaaccgga 2100 

caaggtgaac gcggtctaac cggagcacaa 2160 

gaacgcggtc taaccggagc . acaaggagct 2220 

ggtctaactg gaactcaagg agctaaaggt 2280 

actggagccc aaggagctaa aggtgaaaaa 2340 

gcccaaggtg ctaaaggtga aaaaggagcc 240 0 

ggtgctaaag gtgaaaaagg ggaccaaggc 2460 

aaaggagacc gaggcgaacg cggtttaact 252 0 

gaacgtggta tcactggagc taaaggtgaa 2580 

ggagcccaag gtgctaaagg tgaaaaaggg 2640 

caaggtgaaa aaggagccca aggtcaggca 2700 

gttaaagata ataaaaatga cggcactcat 2760 

gttacaagta ctgttgtaag agatggtttc 282 0 

cgaaatgacg cagataagac aacaactgtt 28 80 

gaattagatg cttctgataa gaaattaaaa 2940 

gaaaaagggg acaaaggtga acaaggtctt 30 00 

ggagaagatg ggaaaacccc aacagttaaa 3060 

attacaatta acgatggtaa aggtggtata 3120 

ggtgcaagcc ctcttgtttc tactcataga 3180 

ttctattatg atctaaatga taataatcaa 3240 

gttgttatcg cagatggaaa acaaggacca 33 00 

ggattcacac cagaagtaac agttacagat 33 60 

acacaaccag acaacagacc atcattaaca 3420 

acaccaaaag tcaaagcaga acgagatgat 348 0 

attgacaaag atggagatgg aagttacaca 3540 

gtagttaaag acggacaaga cggagctgca 3600 

ttaaacggaa aagtagaccc aacaacagaa 3660 

caaacaggag atgtattcgt taagaaaggt 3720 

ggaccgaaag gtgacaaagg tgcagatggt 3780 

cgcggcctga ctggagcgca aggtgtcaag 384 0 

ctaactggat ctaaaggtga aaaaggagac 3900 

ggtgccaaag gtgacaaagg tgaacaaggt 3960 

aaaggagcag atggacaaag aggaccagcg 402 0 

aatccaggaa ctccaggtaa agatggaaaa 408 0 

gtaacaatta caccagtaga aggtcgtcca 414 0 

ggtgctgatg ggaaaactcc aacagtaaca 420 0 

ttaacggttc ataatccagg aagtccagat 4260 
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gtgacaacta cgatccgtga tggagctaca ggacaagcag gtcgtgatgg taaagatgta 4320 

ttaaacggaa aagtaaatcc acaaccaaac caaggtaaaa atggagataa atatattaat 43 8 0 

atcgaaaccg gtgatgtcta tgttaaaaac aatggaaact gggataaaga aggcaacatc 4440 

a aaggcccta aaggtgacaa aggtgcagat ggtgctaaag gcgaaaaagg agaccaaggc 450 0 

5 gaacgcggcc taactggagc gcaaggagct aaaggtgcgg atggcgcagt aggtcgtgat 4560 

ggacgtgacg gtaaagacgt gttgaacggc aaagctaacc cagaagcaca tcaaggtaaa 462 0 

gacggcgata aatacgttaa tacagaaaca ggcgacgtct tcgttaagaa taacggcaac 4680 

tgcjgataaag agggcaacat caaaggccct aaaggtgaca aaggtgcaga tggtgctaaa 474 0 

ggcgaaaaag gagaccgagg cgaacgcggc ctgactggag cgcaaggagc taaaggtgcg 4800 

10 gafcggagcag caggtcgtga cggacgtgat ggacgtgacg gtaaagacgt gttgaacggc 48 60 

aaagttaacc cagaagcaaa tcaaggtaaa gacggcgata aatacgttaa tacagaaaca 492 0 

ggcgacgtct tcgttaagaa taacggcaac tgggataaag agggcaacat caaaggctct 498 0 

aag-ggtgaca aaggtgaacg cggagaagat ggtaagactc cagaagtaac tgtaactcca 5040 

ggtaaagatg gccatagtac tgacattaca ttcactgttc caggtaaaga tccagttaca , .5100 

15 gtaiaatgtta aggacggaga aaatggtctg aacggtaaaa ctccaaaagt tgatttactt 5160 

cgtgtccaag gaaaaaacgg aaatccatct catacaattg tgacattcta tacagatgaa 5220 

aacaatgacg gcaaatatac accaggaact gatgaacttc taggttcaga aatgattaaa 52 8 0 

gatggtgcta aaggcgcgga cggacgagat ggtaaatcat tgcttactgt caaggatggt 5340 

aaagaaacta aagtttacca agaagatcca gctaacccag gacaaccatt aaatccagaa 5400 

20 aaaccacttg cggtaattag agatggagta gatggaaaat cacctacagt tacagctgtt 5460 

cgtaaagatg aagcagggca taaaggtgta gaaatcactg ttgataacca tgatggttca 5520 

caaccaacta cagtctttgt tcaagatggt gctaagggaa aaactggtgc aaccggtcag 558 0 

gatggacaaa ctcctacaat cactactcaa cgtggacaag atggccaaag cactgttgta 5640 

actatcacaa catcaggtaa agatccagta accttcactg taaaagatgg taagaatggt 5700 

25 aaagatggcc gtgcaccgaa aatcaaagta gaagatatta cttcaccttc aagaattaga 5760 

cgcgatacag atgctgctgc aactccaacg cgtaacggta tccgtgttac agtttatgat 5820 

gafcgttaatg acaatggggt atacgacgaa ggtgtcgata aagtattaaa tagtaaagat 58 8 0 

atfc-fcataacg gtatagatgg acgtgatggt tcagctccaa ctattactac aaaagataat 594 0 

ggagatggaa ctcacactat cacagttcaa aacccagatg gttctgaatc aacaacagtt 60 00 

30 gtfcaaagatg gtaaagacgg taaaactgcg aatatcacta caacagaaaa cccagatgga 60 60 

agccacacaa ttacggtgac aaatccagat ggttcaacta aagaaactgt tgttaaaaac 6120 

ggtaaagacg gtaagactcc taaagttgaa gtaacggata acaacgatgg aactcatact 6180 

gttaaagtta cagatggaga cggcaatgtt accaacgcta tcatcaaaga tggtaaagac 6240 

ggtiataagctg caacagcaac aactactgaa aatccagatg gaagccacac agtaacaatc 63 00 

35 actaacccag acggaactaa gaatgagttt gttgttaaga atggacgtga cggtgttgac 6360 

gga,cgtactc caaccgcatc tgttcgtgat aatggagacg gaagtcatac aatcgttatt 6420 

acaaatccag aaggtgtgac aactgaaacc acagttcgtg atggtaaatc accaaaagtg 6480 

actataactg atgaacaaaa tggaactcat aagatctctg ttctaaatgg tgacggaaca 6540 

actactgaaa caatcattaa agatggtaaa tcaccagtag caacagttag agataaccaa 660 0 

40 gatggtactt acactattcg tgtggaaaac ggtaatggta ctgtttctga aaccacagtt 6660 

cgtgacggta aatcaccaac tgctaaggtt gtggataatg gagatggaac tcacactatc 6720 

acagttgtga actcagacgg aataactaca acaactacag ttcgtgatgg tagagaacca 6780 

aaacttgaag ttattgataa caacgatggt tcacacacta ttaaagtgac aggtgctgat 6840 

ggtaaaggaa cgacaactac aatctttgat ggtaaatcac caaaagcgaa catcgttgat 690 0 

45 aacggagatg gaactcatac attaacaatc gtagattctg atggtcgtga atacaaatct 6960 

attatcaaag atggtaaaga cggcaaagat agcgtttcac caactgtaac tgttaaaaat 7020 

aataacgatg gaactcacgt tgttacaatc actaatccag atggaagtaa gacagaaatg 70 80 

gtgattaaag acggtaaaga tggtaaatca ccaaaagttt ctgttgaaga taatggtgat 7140 

ggtagtcata caatcacaat catcaattct gatggaactg tgacaaaaac agttattaaa 7200 

50 gatggcaaag atggtagaga tggacgtgat ggtcgagacg gcaaagacgg taaagatgga 72 60 

aaatgtggat gccaagacaa accagtaaca ccatcaaatg acaaaccagt tcctccaaca 732 0 

ccaaatgtgc cgacaccaga agtaccggtt aaacctgtgc cagcgcaacc aacaccaaat 7380 

gtaccgacac cagaagtgcc agtacaacca actccagctg tttcaacacc agaagtaccg 744 0 

gttaaaccag taccagcggt tccagaacaa ccagtagtac caacaccggc tcaaccagca 7500 

55 actccagtaa atgctaaccc agtagcacca actacaggta aagaaaaccg tggggacaaa 75 60 

ttacctgaaa ctggaagcca atctgattat atctctgttc ttttaggtag cggtattcta 7620 

ttgagcctat atgtaggacg aagaaaagaa gattaa 7656 
<212> Type : DNA 
<211> Length : 7656 
60 SequenceName : SEQ ID 747 
SeguenceDescription : 

Sequence 



65 



60 
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actcaagcag aagaagttct ttggactgca 

actaaaacgg acaacaaaac aagttatacc 
gcagaagcct tgggtgtaga tgtcacagtg 

gacttgattt tcccagaaac tgttttgaca 

5 gaagttgaaa tccaaacacc tcaagcagac 

gatttgacca ctaatcaagt gaccgttgat 

caaccaattg cagaagctcc aaaagaagta 

attgcttctg aagaagtggc accatctacg 

gaaacaagca gtgcagttgc agaagaagct 

10 gaaacacaaa caagccctca agctgcatca 

gcaaaagaag tagcatcatc aaatggagct 

gaaacgaaaa taatttcaac aacttacgag 

gcagtagcaa aatctgaaaa tgcaggtctt 

attgctsact tgtttggcat tacatccttt 

15 cacggaaaag gtttggctat cgattttatg 

attgcggaat atgctattca aaatatggcc 

caacgtttct atgctccatt cgatagcaaa 

ccagaccgtg gtagtgtgac agaaaatcac 
taa 

20 <212> Type : DNA 

<211> Length : 1143 

SequenceName : SEQ ID 748 
SequenceDescription : 



cgtagtgttg agcaaatcca aaacgatttg 12 0 

gtacagtatg gtgatacttt gagcaccatt 180 

cttgcgaatc tgaacaaaat cactaatatg 24 0 

acgactgtca atgaagcaga agaagtaaca 3 00 

tctagtgaag aagtgacaac tgcgacagca 360 

gatcaaactg ttcaggttgc agacctttct 420 

gcatcaagtt cagaagttac aaagacagtg 48 0 

ggcacttctg tcccagagga gcaaacggcc 540 

cctcaggaaa cgactccagc tgagaagcag 600 

gcagtggaag caactacaac aagttcagaa 660 

acagcagcag tttctactta tcaaccagaa 72 0 

gcaccagctg cgcccgatta tgctggactt 78 0 

caaccacaaa cagctgcctt taaagaagaa 840 
agtggttatc gtccaggaga cagtggagat . . ,.900 

gtaccagaac gttcagaatt aggggataag 960 

agccgtggca ttagttacat catctggaaa 1020 

tatgggccag ctaacacttg gaacccaatg 1080 

tatgatcacg ttcacgtttc aatgaatgga 1140 

1143 



25 Sequence 



<213> OrganismName : Streptococci 
<400> PreSequenceString : 
gtgacgattc taggaaaaga tacagttcaa 

30 gaagctacac cagagtataa gctagaaaat 
ggaagctcag atgctaatgc gaatgaaggc 
acaggttcac aaaactcagc tcaatcacaa 
tcagctaaaa atgccattga aaaagcagcc 
ccgctttctg ataaagaaaa agcagaactt 

35 gctctcaaag agattgaaaa tgcgaaaact 
ggagtgcaag ccattgccat ggttacagtt 
cctaagacaa caagtgcacc gcaagcaact 
tcacctgctg gcaaacaatt acctaacaca 
cttggtctag tggtggcaac aagtggtttt 

40 aaatag 

<212> Type : DNA 
<211> Length : 666 

SequenceName : SEQ ID 749 
SequenceDescription : 

45 

Sequence 



is pneumoniae R6 

caatctgcga aaggtgaatc tgtaactcaa 60 

acaccaggtg gagataaggg aggcaatact 120 

ggtggtagcc aggcgggtgg atcagctcac 180 

gcttctaagc aattagctac tgaaaaagaa 240 

aagaacaagc aggatgaaat caaaggcgca 300 

ttagcaagag tggaagcaga aaaacaagca 360 

atggaagatg tgaaggaagc agaaacgatt 420 

cctaagagac cagtggctcc taatgctgct 480 

gcaggaacaa tgcaagatgt tacctaccag 540 

ggttcagcat caagtgcagc acttgctagt 600 

gctttgctag gaagaaagac tagacgtaga 660 

666 



<213> OrganismName : Streptococcus pneumoniae R6 
<400> PreSequenceString : 

50 atgatgacga caggttgctc tatgggagcc tatcatgcac tcaatttctt cctccagcat 60 
ccagatgtct ttaccaaagt gattgctctc agtggtgttt acgacgcacg tttctttgtc 12 0 

ggtgattact acaacgatga tgctatttac caaaactcgc cagtagatta tatttggaac 180 
caaaacgacg gctggtttat tgaccgttac cgtcaggcag agattgtgct gtgtacgggg 240 
cttggagcct gggaacaaga tggtttgcca tccttttaca agctcaaaga agcctttgac 3 00 

55 aagaaacaaa ttccagcctg gtttgctgaa tggggacatg atgtcgccca tgactgggaa 3 60 

tggtggcgta aacaaatgcc ttatttcctc ggtaatctct atttataa 408 
<212> Type : DNA 
<211> Length : 40 8 

SequenceName : SEQ ID 750 

50 SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
65 <4 00> PreSequenceString : 

atgaataaag gattatttga aaaacgttgt aaatatagta ttcggaaatt ttcattaggt 60 
gttgcttctg ttatgattgg agctacattc tttgggacaa gtccggttct tgcagatagc 12 0 
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gtgcagtctg gttccacggc gaacttacca gctgatttag ctactgctct tgcaacagca 180 
aaagagaatg- atgggcatga ttttgaagcg cctaaggtgg gagaagacca aggttctcca 240 
gaagttacag atggacctaa gacagaagaa gaactattag cacttgaaaa agaaaaaccg 3 00 

gctgaagaaa. aaccaaaaga ggataaacct gcagctgcta aacctgaaac acctaagacg 3 60 

gtaacccctg aatggcaaac ggtagagaaa aaagaacaac agggaacagt cactatccga 42 0 

gaagaaaaag gtgtccgcta caaccaatta tcctcaactg ctcaaaatga taacgcaggt 480 
aaaccagccc tgtttgaaaa gaagggcttg accgttgatg ccaatggaaa tgcaactgtt 54 0 

gatttaacct tcaaagatga ttctgaaaag ggcaaatcac gctttggtgt cttcttgaaa 60 0 

tttaaagata. ccaagaataa tgtttttgtc ggttacgaca aggatggctg gttctgggag 660 
tataaatctc caacaactag cacttggtat agaggtagtc gtgttgctgc tcctgaaaca 720 
ggatcaacaa accgtctctc tatcactctc aagtcagacg gtcagctaaa tgccagcaat 780 
aacgatgtca atctctttga cacagtgact ctaccagctg cggtcaatga ccatcttaaa 840 
aatgagaaga. agattcttct caaggcgggc tcttatgacg atgagcgaac agttgttagc 90 0 

gttaaaacgg ataaccaaga gggggtaaaa acagaggata cccctgctga aaaagaaaca 36 r » 

ggtcctgaag ttgatgatag caaggtgact tatgacacga ttcagtctaa ggttctcaaa 102 0 

gcagtgattg accaagcctt ccctcgtgtc aaggaataca gcttgaatgg acatacfcttg 1080 

ccaggacagg ttcaacagtt caaccaagtc tttatcaata accaccgaat cacccctgaa 1140 

gtcacttata. agaaaatcaa tgagacaaca gcagagtact tgatgaagct tcgcgatgat 12 00 

gctcacttaa. tcaatgcgga aatgacagta cgcttgcaag ttgtggacaa tcaattgcac 1260 

tttgatgtgaa ccaagattgt caaccacaat caagtcactc caggtcaaaa gattgatgac 132 0 

gaaagaaaac tactttcttc tattagtttc ctcggcaatg ctttagtctc tgtttctagt 13 80 

gatcaaactg gtgctaagtt tgatggggca accatgtcaa acaatacgca tgtcagcgga 1440 

gatgatcata. tcgatgtaac caatccaatg aaagatctag ccaagggtta catgtatgga 1500 

tttgtttctsL cagataagct tgctgctggt gtttggagta actctcaaaa cagctatggt 1560 

ggtggttcga. atgactggac tcgtttgaca gcctataaag aaacagtcgg aaatgccaac 1620 

tatgtaggaa tccacagctc tgaatggcaa tgggaaaaag cttataaggg cattgttttc 1680 

ccagaataca cgaaggaact tccaagtgct aaggttgtta tcactgaaga tgccaatgca 1740 

gacaagaaag tcgattggca ggatggtgcc attgcttatc gtagcattat gaacaatcct 18 0 0 

caaggttgga. aaaaagttaa ggatatcaca gcttaccgta tcgcgatgaa ctttggttct 18 60 

caagcacaaa acccattect tatgaccttg gatggtatca agaaaatcaa tctccacaca 192 0 

gatggtcttg- ggcaaggtgt tctccttaaa ggatatggta gcgaaggcca tgactctggt 1980 

cacttgaact atgctgatat tggtaagcgt atcggtggtg tcgaagactt caagacccta 2040 

attgagaaggr ctaagaaata tggagctcat ctaggtatcc acgttaacgc ttcagaaact 2100 

tatcctgagt ctaaatactt caatgaaaaa attctccgta agaatccaga tggaagctat 2160 

agctatggtt ggaactggct agatcaaggt atcaacattg atgctgccta tgacctagct 2220 

catggtcgtt tggcacgttg ggaagatttg aagaaaaaac ttggtgacgg tctcgacttt 2280 

atcfcatgtgg acgtttgggg taatggtcaa tcaggtgata acggtgcctg ggctacccac 23 40 

gttcttgcta aagaaattaa caaacaaggc tggcgctttg cgatcgagtg gggccatggt 2400 

ggtgagtacg actctacctt ccatcactgg gcagctgact tgacctacgg tggctacacc 2460 

aataaaggta tcaacagtgc catcacccgc tttatacgta accaccaaaa agatgcttgg 2520 

gtaggggact acagaagtta tggtggtgca gccaactatc cactgctagg tggctacagc 258 0 

atgaaagact ttgaaggctg gcaaggaaga agtgactaca afcggctatgt aactaactta 2640 

tttgcccatg acgtcatgac caagtacttc caacacttca ctgtaagtaa atgggaaaat 2700 

ggtacaccggr tgactatgac cgataacggt agcacctata aatggactcc agaaatgcga 2760 

gtggaattgg tagatgctga caataataaa gtagttgtaa ctcgtaagtc aaatgatgtc 282 0 

aatagtccac aatatcgcga acgtacagta actctcaacg gacgtgtcat ccaagatggt 28 80 

tcagcttact fcgacfcccttg gaactgggat gcaaatggta agaaactttc tactgataag 2940 

gaaaagatgt actacttcaa tacgcaggcc ggtgcaacaa cttggaccct tccaagcgat 3 00 0 

tgggcaaaga gcaaggttta cctttacaag ctaactgacc aaggtaagac agaagagcaa 3060 

gaactaactg taaaagatgg taaaattacc ctagatcttc tagcaaatca accatacgtt 312 0 

ctctatcgtt cgaaacaaac caatcctgaa atgtcatgga gtgaaggcat gcacatctat 3180 

gaccaaggafc ttaacagtgg taccttgaaa cattggacca tttcaggcga tgcttctaag 3240 

gcagaaattg tcaagtctca aggggcaaac gatatgcttc gtattcaagg aaacaaagaa 33 0 0 

aaagttagtc tcactcagaa attaactggc ttgaaaccaa ataccaagta tgccgtttat 33 60 

gtcggtgtcg ataaccgtag taatgccaag gcgagcatca ctgtaaatac tggtgaaaaa 342 0 

gaagtgacta cttataccaa taagtctctc gccctcaact atgtaaaagc ctatgcccac 348 0 

aatacacgtc gtaacaatgc tacagttgac gatacaagtt acttccaaaa catgtacgcc 3540 

ttctttacaa ctggatcgga cgtatcaaat gttactctga cattgagtcg tgaagctggt 3 600 

gatgaagcaa cttactttga tgaaattcgt acctttgaaa acaattcaag catgtacgga 3 660 

gacaagcatg atacaggtaa aggcaccttc aagcaagact ttgaaaatgt tgctcagggt 372 0 

atcttcccat ttgtagtggg tggtgtcgaa ggtgtcgaag acaaccgcac tcacttgtct 3 780 

gaaaaacacg atccatatac acaacgtggt tggaatggta agaaagtcga tgatgttatc 3 840 

gaaggaaatt ggtcactcaa gacaaatgga ctagtgagcc gtcgtaactt ggtttaccaa 390 0 

actattccgc aaaacttccg ttttgaagca ggtaagacct accgtgtaac ctttgaatac 3960 

gaagcaggtt cagacaatac ctatgctttt gtagtcggta agggagaatt ccagtcaggt 402 0 

cgtcgtggta ctcaagcaag caacttggaa atgcatgaat tgccaaatac ttggacagat 4080 

tctaagaaag ccaagaaggc aaccttcctc gtgacaggtg cagaaacagg ggatacttgg 4140 



WO 2005/076010 



PCT/IN2005/000037 



334/341 



gtaggtatct actcaactgg aaatgcaagt aatactcgtg gtgattctgg tggaaatgcc 42 0 0 

aacttccgtg gttataacga cttcatgatg gataatcttc aaatcgaaga aattacccta 4260 

acaggtaaga tgttgacaga aaatgctctg aagaactact tgccaacggt tgccatgact 4320 

aactacacca aagagtctat ggatgctttg aaagaggcgg tctttaacct cagtcaggcc 43 8 0 

gatgatgata tcagtgtgga agaagcgcgt gcagagattg ccaagattga agccttgaag 444 0 

aatgctttgg ttcagaagaa aacggctttg gtagcagatg actttgcaag tcttacagct 450 0 

cctgctcagg ctcaagaagg tcttgcaaat gcctttgatg gaaacttatc tagtttatgg 4560 

catacatcat ggggcggagg agatgtaggc aagcctgcaa ccatggtctt gaaagaagca 462 0 

actgaaatca ctggacttcg ttatgttcca cgtggatcag gttcaaatgg taacttgcgt 4680 

gatgtgaaac ttgttgtgac agatgagtct ggcaaggagc atacctttac tgcaactgat 4740 

t^gccagata acaataagcc aaaagacatt gattttggta agacaattaa ggctaagaaa 48 0 0 

attgtcctta caggtactaa gacttacgga gatggtggcg ataaatacca atctgcagcg 48 60 

gaactcatct ttactcgtcc acaggtagca gaaacacctc ttgacttgtc aggcfcatgaa 492 0 

gcagctttgg ctaaggctca gaaattaaca gacaaagaca atcaagagga agtagctagc .,.498 0 

gttcaggcaa gcatgaaata tgcgacggat aaccatctct tgacggaaag aatggtggaa 5040 

tactttgcag attatctcaa ccaattaaaa gattctgcta cgaaaccaga tgctccaact 510 0 

gtagagaaac ctgagtttaa acttagctct gtagcttccg atcaaggtaa gacgccagat 5160 

tataagcaag aaatagctag accagaaaca cctgaacaaa tcttgccagc aacaggtgag 522 0 

agtcaatttg acacagccct cttcctagca agtgttagcc tagccctatc tgctctcttt 52 8 0 

gtagtaaaaa cgaagaaaga ctag 53 04 
<212> Type : DNA 
<211> Length : 53 04 

SequenceName : SEQ ID 751 

SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pneumoniae R6 
<4O0> PreSequenceString : 

atgaaattat ataataaatc agaattacgt tattctcgca ttttctttga taagagacca 60 

ccagcttttg cgtttattct cattatttca actgctatta tcttaagcgg tgcattggtt 120 

ggcgcagctt atatacccaa aaactatatt gtaaaagcga atggaaattc agtcataaca 18 0 

gg-tacagagt tcctatcagc aattagttca gggaaggtag ttactttaca taagtcagaa 240 

ggagatatgg taaatgctgg agatgtcatt atttcgttat caagtggaca agaaggttta 3 00 

caagcgagct ctttaaataa acaattggtg aagttacgtg caaaagaagc tatctttcaa 3 60 

aaatttgaac aatcattaaa tgagaaatac aaccgtatgt ctaattctgg tgaagaacag 420 

gaatattatg ggaaagttga atactaccta tctcagttaa attcggaaaa ttataataat 480 

gg-tacccagt attcaaagat tcaggatgaa tatacgaagt tgaataaaat aacagctgaa 540 

agaaatcagt tagatgccga cttgcaaact ctacaaaatg aattgattca acttcaacag 600 

caaggagact ccccttcctt atcagatacg acatcagctg atgataaagc taagttagaa 660 

actaagatat tagaaataac aacaaaaatc gaagcattaa aaacaaatat tacttctaag 720 

aatragcgaga ttgatagtca acaaagcaat attaaagata tgaaccgtac ctataatgat 780 

ccaacttctc aggcttataa tatttatgct caattagtta gtgagttagg tactgctcgt 840 

tcaaacaaca ataaaagtat tacagagctt gaggctaatc ttggagtggc aacaggtcaa 900 

gataaagctc atagtatatt agcgccaaat gaaggtactc tgcattattt ggtacctttg 960 

aaacaaggaa tgtctattca gcaggggcaa acgatagcag aagtttcagg gaaagaaaaa 1020 

ggttactatg tagaagcttt tgtacttgcg agtgatattt ctcgtgtctc aaaaggagca 10 80 

aaagttgatg ttgctattac tggtgtaaat agtcaaaaat atggaacact aaagggacaa 1140 

gtcagacaga ttgattcagg aacaatttcc caagaaacga aagaggggaa tattagcctc 12 00 

tataaagtca tgatagaatt agaaacctta actctaaaac atggaagtga gacagtcgtc 12 60 

ctccaaaagg atatgccagt tgaagtgcgg attgtctatg ataaagaaac ctatcttgat 13 20 

tggattttgg aaatgttaag tttcaagcaa taa 13 53 
<212> Type : DNA 
<211> Length : 1353 

SequenceName : SEQ ID 752 

SequenceDescription : 

Sequence 



<213> OrganismName : Neisseria meningitidis serogroup A strain Z2491 
<4O0> PreSequenceString : 

atgaataaag gtttacatcg cattatcttt agtaaaaagc acagcaccat ggttgcagta 60 

gccgaaactg ccaacagcca gggcaaaggt aaacaggcag gcagttcggt ttctgtttca 12 0 

ctcjaaaactt caggcgacct ttgcggcaaa ctcaaaacca cccttaaaac cttggtctgc 180 

tcfcttggttt ccctgagtat ggtattgcct gcccatgccc aaattaccac cgacaaatca 240 

gcacctaaaa accagcaggt cgttatcctt aaaaccaaca ctggtgcccc cttggtgaat 300 

atccaaactc cgaatggacg cggattgagc cacaaccgct atacgcagtt tgatgttgac 360 
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aacaaagggg cagtgttaaa caacgaccgt 
gcgcaattga ttttgaacga ggtacgcggt 
gtaggcggtc aaaaggccga cgtgattatt 
ggcggcttta aaaatgtcgg tcggggcatc 
. 5 gacggtgcac tgacaggatt tgatgtgcgt 
tggaatgata aaggcggagc cgactacacc 
gggaaattac agggtaaaaa cctggcggtt 
agcggcgaaa tcagtgcagg tacggcagcg 
gccgcactgg gcggtatgta cgccgacagc 
10 ggcgtcaaaa atgccggcac actcgaagcg 
cgcattgaaa acagcggccg catcgccacc 
tatctctcca tcgaaaccac cgaaaaagga 
cggatcgaga gcaaaggctt attggttatt 
ggagccgtgg tgcagaataa cgg.cagtcgc 
15 aatttggtga fctgagagtaa aactaatgtg 
gccggcggtc gtactacgat caatgatgct 
tccaccaaag gcgatactga attgggtgaa 
gtattatcta acggtagtat tggcagtgct 
attgaatcgg gcaaaccgct ttctttagaa 

20 aacaacggta acattaaagg cggaaagcag 
gccaaaacta ccaatctgaa tactcccggc 
aatttgaatg ttgataaaga tttgtctgcc 
gcccatatta ccggcaccag taaaaccctc 
ggcttgctga atgttaccaa taccaatctg 

25 gcagccaaag gcaatattca gcttcgcaat 
accaccgcat tgcagggcaa tatcgtttca 
catgtatcct tattggccaa cggtaatgcc 
aaggccgatg tcaatgcagg atcggttggt 
atcacttcat cttcaggaga tattacgttg 

30 ggaaaacaac gcaattcaat caacggaaaa 
gccgacttaa aaaaccttaa cgtccatgcc 
cgggcattga gcatagaaaa fcaccaagctg 
:caacacgagc gggtaacgct caaccaagta 
accggcagcc agatfctggca aaacgacaaa 

35 ggtgtattgg cactcaatgc gcgcfcattcc 
ggtgcaatca accttactgc cggtaccgcc 
accgtttcga ccaagacttt ggaagafcaat 
aatattgaag caggtagcgg cacattaacc 
accgacctga gcatcaaaac aggcggaaaa 

40 ggtgcgccta gtgctcaagt ttcctcattg 
ggagaaacag atttaagagg ttctaaaatt 
accaaaggca agttgaatat cgaagccgta 
caaaaagcgg ctgaactcaa ccaaaaatcc 
aaaaaaagct cgccfcaaaag caagctgatt 

45 gctttctata ttcaagccat caacaaggaa 
tacctgcaag ccaagctttc tgcacaaaat 
atcagcggtt ccgatattac cgcttccaaa 
ccaaaggcag cagattcaga ggcggctgct 
gaaattggca agcccaccta caagagtcac 

50 cgtttgaccg gacgtacggg ggtaagtatt 
attattatcg gtgcatccga aatcaaagct 
agtgatattg tactggaggc tggacaaaac 
aaaagcggca aaatcatcag aaaaaccaag 
ccagcccccg tcgagctgac cgccaacggt 

55 gctaatacca cccgcttcaa tgcccctgca 
ctgcaactgc tggcagaaga aggcatccac 
cgctttatcg gcatcaaggt aggtaagagc 
aaattgcctg tccgcgtcgt cgcccaaact 
ctcgaaggta ccgaattcaa aaccacgctg 

60 gaaaaagccc gtgtcgatgc gaaaattatc 
gaagaaaaat tagaaaccaa ctcaaccgta 
atcgaaacgc taaaactgcc cagcttcgaa 
ggcggctata tcgtcgacat tccgaaaggc 
aaacagcccg agtatgccta tctgaaacag 

65 caggtgcagc ttgcttacga cagatgggac 
gcggcgatta tcgcactggc cgttaccgtg 
ttgggattaa acggtgcggc cgccgccgca 



aacaataatc cgfcttctggt caaaggcagt 420 

acggctagca aactcaacgg catcgttacc 480 

gccaacccca acggcattac cgttaatggc 540 

ttaactatcg gtgcgcccca aatcggcaaa 60 0 

caaggcacat tgaccgtagg agcagcaggt 660 

ggggtacttg cfccgtgcagt tgctttgcag 72 0 

tctaccggtc ctcagaaagt agattacgcc 780 

ggtacgaaac cgactatfcgc ccttgatact 84 0 

atcacactga ttgccaatga aaaaggcgta 900 

gccaagcaat tgattgtgac ttcgtcaggc 960 

actgccgacg gcaccgaagc ttcaccgact 1020 

gcggcaggca catttatctc caatggtggt 1080 

gagacgggag aagatatcag cttgcgtaac 1140 
ccagctacca cggtattaaa tgctggtcat . .,1200 

aacaatgcca aaggctcggc taatctgtcg 1260 

actattcaag cgggcagttc cgtgtacagc 1320 

aatacccgta ttattgctga aaacgtaacc 13 8 0 

gctgtaattg aggctaaaga cactgcacac 144 0 

acctcgaccg ttgcctccaa catccgtttg 15 0 0 

cttgcttfcac tggcagacga taacattact 1560 

aatctgtatg ttcatacagg taaagatctg 1620 

gccagcatcc atttgaaatc ggataacgct 1680 

actgcctcaa aagacatggg tgtggaggca 1740 

cgtaccaact cgggtaatct gcacattcag 18 00 

accaagctga acgcagccaa ggctctcgaa 18 60 

gacggccttc atgctgtttc tgcagacggt 192 0 

gactttaccg gtcacaatac cctgacagcc 198 0 

aaaggccgtc tgaaagcaga caataccaat 204 0 

gttgccggca acggtattca gcttggtgac 2100 

cacatcagca tcaaaaacaa cggtggtaat 2160 

aaaagcgggg cattgaacat tcattccgac 2220 

gagtctaccc ataatacgca tcttaatgca 228 0 

gatgcctacg cacaccgtca tctaagcatt 2340 

ctgccttctg ccaacaagct ggtggctaac 240 0 

caaattgccg acaacaccac gctgagagcg 2460 

ctagtcaagc gcggcaacat caattggagt 2520 

gccgaattaa aaccatfcggc cggacggctg 2580 

atcgaacctg ccaaccgcat cagtgcgcat 2640 

ttgctgttgt ctgcaaaagg aggaaatgca 2700 

gaagcaaaag gcaatatccg tctggttaca 2760 

acagccggta aaaacttggt tgtcgccacc 282 0 

aacaactcat tcagcaatta ttttcctaca 2880 

aaagaattgg aacagcagat tgcgcagttg 2940 

ccaaccctgc aagaagaacg cgaccgtctc 3 0 00 

gttaaaggta aaaaacccaa aggcaaagaa 3060 

attgacttga tttccgcaca aggcatcgaa 3120 

aaactgaacc ttcacgccgc aggcgtattg 3180 

attctgattg acggcataac cgaccaatat 3240 

tacgacaaag ctgctctgaa caagccttca 33 00 

catgcagctg cggcactcga tgatgcacgt 3360 

ccctcaggca gcatagacat caaagcccat 342 0 

gatgcctata ccttcttaaa aaccaaaggt 348 0 

tttaccagca cccgcgacca cctgattatg 3540 

atcacgcttc aggcaggcgg caacatcgaa 360 0 

ggtaaagtta ccctggttgc gggtgaagag 3660 

aagcacgagt tggatgtcca aaaaagccgc 372 0 

aattacagta aaaacgaact gaacgaaacc 378 0 

gcagccaccc gttcaggctg ggataccgtg 3 840 

gccggtgccg acattcaggc aggtgtaggc 3900 

ctcaaaggca ttgtgaaccg tatccagtcg 3960 

tggcagaaac aggccggacg cggcagcact 402 0 

agccctactc cgcccaaatt gtccgcaccc 408 0 

aatctgaaaa ccgaaatcga aaagctgtcc 4140 

ctccaagtag cgaaaaacat caactggaat 42 0 0 

tacaaacagg agggcttaac cgaagcaggt 4260 

gtcacctcag gcgcaggaac cggagccgta 432 0 

accgatgcag cattcgcctc tttggccagc 43 80 
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caggcttccg tatcgttcat caacaacaaa ggcgatgtcg gcaaaaccct 
ggcagaagca gcacggtgaa aaatctggtg gttgccgccg ctaccgcagg 
aaaatcggcg cttcggcact gaacaatgtc agcgataagc agtggatcaa 
gtcaacctag ccaatgcggg cagtgccgca ctgattaata ccgccafccaa 
ctcaaagaca acttgggcga tgccgcactg ggtgcgatag tcagtaccgt 
gtagcgagca aaatcaaatt taatctcagc gaagactaca ttacccacaa 
gccatagcgg gctgtgcggc agcggcggcg aataagggta agtgtcagga 
ggtgcggctg tgggcgagat agtcggggag gctttgacaa acggcaaaaa 
ttgacagcta aagaacgcga acagattttg gcatacagca aactggttgc 
agcggtgtgg tcggcggcga tgtgaataca gcggcgaatg cggctaaagt 
aataacctat tatctcaaga agagtatgct cttagagaaa aattgatcaa 
gggaaaggcc tattatcttt agattggggc agcctgaccg aacaagaggc 
atctatttga ttgagaaaga tcgatattct aatcaattgc ttgaccgata 
ccaagtagtt taaataatca agaaaaaaat attcttgcat attttattaa 
ggaggtaaca cagcttgggc agcttcgata ctgaaaacgc cccagtcaat 
actattcctt ccaaagatat taataacacc ttatcgaaag cctatcaaac 
tatgattctt ttgattacaa atcagctgtt gccgcacaac ctgcacttta 
ggaccgcttg gcttcagtgt caaagcagct actgtggcag caggaggata 
cagggagcga aagcaatctc taatggagaa tatctgcatg gtacagttca 
ggcacattga tggttgcagg atctgtatct gcacaggctg caatatcggc 
cctgttaccc gttatctgag caatgacagt gctcctgctt taagacaagc 
gaaagccaga gaatccgcat gaaactgccg gaagagtatc gacaaatagg 
atagcaaaaa ttgatgttaa aggattaccg caaaggatgg aagcatttag 
aaaggggaac atggatttat ttcgttacct gaaacaaaaa tttttaaacc 
gataaatatc ataatattgc ctctcctcct agaggaacat taagaaatat 
tataaattac ttgaaactat agcacagcaa ctcggaaata atcgtaatgt 
attgatctat ttacagaatt aaaggcctgt caatcttgca gcaatgttat 
agaaatcgct atccaaatat tcaattaaat atttttacag gaaaatag 
<212> Type : DNA 
<211> Length : 6048 

SequenceName : SEQ XD 753 

SequenceDescription : 



gaaagagctg 

cgtagccgac 

caacctgacc 

cggcggcagc 

acacggagaa 

gattgcccat 

tggtgcgatc 

tcctgccact 

cggtacggta 

cgcgattgaa 

aaaagccaaa 

aaggcagttt 

tcaaaaaaat 

ccaaacctct. 

gggtaafcctc 

attgagtcgt 

cttattaaac 

taacattgga 

ggttgttaat 

caagcctgca 

tttaactgct 

gaatcttgcg 

ttctttccaa 

tatatctgtt 

agatggagaa 

atcaggtaga 

tttagagttt 



4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
, 5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6048 



Sequence 

<213> QrganismName : Neisseria meningitidis 
<400> PreSequenceString : 

atggacttaa tccaaacccc gaataagcaa tttgtcgacg 
actcccgtac ccgcatggtg gctgaaccag ttacaaggcg 
9*cggttggca ttgagcctaa caaagccgac catgcccaag 
fctggccgccg atgcttcgca ggttgccagt atcgatgctc 
ggctatgtga acgtcaacgc ctatcacgcc aatacaacag 
cjcggataaag ccgataaatc taccgctgat aacggctgta 
ggcacgcgct ggaagcgtgt gttttcaggg atgcttaacc 
gccagcaaaa acaatgcact atccactttg aatgccgctg 
gttgttgatt gcttgggttt gtcaattgat acgggtaata 
tacacaaacg gcaagtttgt gattaacggc aaaactgtcg 
agaagcggta tcggtcgatt catctccgga actggtgcag 
gaatggactg gcgcgggttt aatcgttatt ggcgaaggcg 
tgtgtttcct caatcgctat tggcgaccgt gcgcagggct 
aacatcgcca ttggggccga cagcctgatt aatgtgcagg 
cagtcacgca tggaaggcac gcgcaacatc ggtattggtg 
accagcggtt actctaatgt gtcaatcggg cgcaatgccg 
tcgtcaaata ttgcacttgg cgcaggcgcg atggctggta 
ggcgacattg aagttttttg gccgtcttcg acctcaagaa 
gtcttgcaaa catatcaggg ccgcgccgct caaaccgcaa 
aatacaaaaa aggccgaaaa agttaccgca atcggttctg 
cgaaaccgcg ccccaaatgg cggagatgtt gtctggacgg 
cjcccaatctg gaaaaaacat cacgcttaca tttcccaaca 
tattgggtgg gcatccgcct tacatcaggc acggcgcaaa 
ccggctcagg tcgtatcagt gaatggcaat acattaatca 
accgccaccg gcgcggccga actgaaatac gtttattctg 
aacgaagagt tgaccatcat cggcgcgaac gccatgaata 
tcaactatca tcggcgtaga tgccgcgttg ttgggagaca 
sfccggcgcat catctttacg aacaggtagt catatttcca 
gtaatccctt tggcaagtag tgagaaatgt gttgccattg 
aacgttcaag gcgacttttt gactgggaaa ataacaaact 



serogroup A strain £2491 



gcgaccgccg 
agttgtacag 
tcttatcggc 
tgcgtaaata 
tgggcggcgg 
ccgttattgt 
tgcatgattt 
aatctgccgc 
tctacccgca 
atgttcaata 
cagccaacct 
caatggagca 
tttctaaagt 
ccgctactga 
gtaatgcagg 
gacagggatt 
ctgctccagt 
caatcgcaat 
ttggtgccaa 
ccgcgatgga 
gaacggaagc 
ttcgcggtgc 
ccttacaaaa 
tccaaagctc 
taaattcaac 
aggcattgac 
attatcaaaa 
caactgctat 
gagatagtgc 
ccatcgccat 



cacgcccggt 
cattttaaac 
cattaaaacg 
cagcggcaca 
cgtgtttgtg 
ttctaccgac 
tggatatgtg 
gcttgacgta 
aaaaaacaaa 
ccagcctatc 
caaatcgaat 
gatggagaaa 
aagcagggac 
atggtacgac 
acgcggcatc 

gggtgaaggc 
cggttttagt 
cggcgaggct 
tgcggcgcga 
gaatcttgag 
aggtacctac 
gcaagcgact 
cgacgtcgta 
aaaagagctg 
cgctactaaa 
cgcaggatac 
aacaaccgca 

tgggtattgg 

gggctatcgg 
cggatatggc 



60 
120 
L80 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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10 



15 



20 



25 



30 



35 



40 



gcaagaataa acggcgataa cgaaatccaa 
ccaaccgcgg tgaacatccg ttctgacggc 
aacggtttag attttgtaat gaagctcaag 
tcctacgttg acgaattatt caaagacttg 
gaatggtggg cgaatccaat caaggacggc 
tttattgccc aggacattgc tgcgctggaa 
aaaacaaacg atacctacac cgtcgaatac 
attcaggaaa tggccgcaag aattgaaaca 
tga 

<212> Type : DNA 

<211> Length : 2283 

SeguenceName : SEQ ID 754 
SequenceDescription : 

Sequence 



atcggtacga 
cgcgacaaag 
ccgatgactg 
ccggcagatg 
agtcataaag 
gatgaatatg 
gaaacgttca 
ttagaaaccg 



cagggcaaac 
cagatgttaa 
gctactacga 
aacgagcgga 
aagatcggtt 
gtcgattgcc 
tccccgtttt 
aaatgaagga 



tttatatgct 
gccgttgacg 
ccgccgggat 
caaagtccgc 
gcggcattgg 
gatggtaaat 
gactaaagcc 
atcgaaaaaa 



<213> OrganismName : Streptococcus pyogenes 
<400> PreSequenceString : 

atgaaaaata tctcacgcaa atgctttatg accagtgtgg 
attcttctag gggcfcggcta tgcaactgga ggacttcagg 
cccaaaaagg tcatcaaaac atttgaccaa ataactgctc 
tcaactatta cagtagagac aggacccgtt caaagaccaa 
cctaaattta ttgaccctat cgtcacaaca ttaacaggta 
aaacccaaag acattgtcat cactggtgga attgaaattt 
agtcgtcaag agaagaacta ccgttctatt accattactg 
aatgaagtta agggaagtaa tgtcccacat accactttgt 
atgcaattcg atggcaatct tactctttta cataccaaag 
ggtatgttgg aagccactaa aagtcagtta acaaatctcg 
ttttcaaacc tgactgattc tagtgtggaa aatgggacca 
ctaactacta aagataccac tctaaaagcc atcaatattc 
atagaagccg agagaacaac ccttgaaaat gtgaccttca 
gaagaagagg agaacgacta ctafcgacaat gatgctatct 
cttaaaggga ctaatactat tagtggtggt gatattgatg 
■gcaaaggcca tcgcctacag ggcaaggacc gaaaatggta 
ctgacaccag ctaagattgg taaggaatca acttcagatg 
aataaagcag ctactggaaa tfctaacggtt aatctcaata 
tga 

<212> Type : DNA 
<211> Length : 1083 

SeguenceName : SEQ ID 755 

SequenceDe scrip ti on : 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2283 



strain MGAS8232 




tatgtattat 


tctaggaggc 


60 


acattaaaca 


ccaaacagct 


120 


ttgatattga 


cagttctgcc 


180 


cagtgaccta 


ttacacacat 


240 


agaccctatc 


tctttcgcaa 


300 


taggttttac 


tctcaataat 


360 


ttcctgaaaa 


aactagcctc 


420 


caaatctaac 


tgtccaagat 


480 


tcaagaaagc 


tactatcact 


540 


agttaaaagc 


tgactattct 


600 


tcagcttagg 


aaatggacaa 


660 


aatcattaca 


ccctggcggt 


720 


ctgtttctaa 


aagcaaagaa 


780 


tcaccgctca 


tgcccttacc 


840 


ttgacataac 


cttgacaaaa 


900 


aagtctccct 


tggctcacag 


960 


ttatttctta 


tgtggctgag 


1020 


agggagacat 


tactatcaaa 


1080 






1083 



Sequence 

45 <213> OrganismName : Streptococcus pyogenes 
<400> PreSequenceString : 

atgtttaaga aagaaaattt aaaacaacgt tattttaatt 
ctaacaatat tagccatcat ttttgccttc tcaagtaaaa 
gctaagaagt cagaaagtaa aatggtaaca atcgacaagg 

50 attactaaag aagaaagcaa agaaaaagca aagagcattg 
gtagaaaact ctgtagctcc gacagtaaca gaggaagcac 
actcaaactg ttcagcaggt atcttcagta gcctataatc 
aatggaaata ctgctggtat tgtaggaagt caagcggcgg 
ggtgttccac aatcaacttg ggaacatata attgcgcgtg 

55 gcagctaatg cttctggggc atcagggttg ttccagacaa 
gcaacggttg aagatcaagt caatgcagcc ttgaaagcct 
gcttggggtt actaa 
<212> Type : DNA 
<211> Length : 615 

60 SeguenceName : SEQ ID 756 

SequenceDescription : 



strain MGAS8232 



ttggattagt 
atgctgatac 
ctccaaaaaa 
cttcggagcc 
cggttgttca 
caaataatgt 
cacagatggc 
aatctaatgg 
tgccaggttg 
atagtgcaca 



agcgttagct 
taagtcttat 
taatcatgct 
tattcccaca 
gcaagaagtg 
ggtactttcc 
agcagcaaca 
aaatcctaac 
gggttctaca 
aggtttatca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
615 



Sequence 

65 <213> OrganismName : Streptococcus pyogenes strain MGAS8232 
<400> PreSequenceString : 

atgttagaag aattgaaaac acttattaaa aatccaaaat taatgattac aatgattggt 



60 
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gtggccctag 
ggtcgggtca 
gataagtcat 
gactatcatt 
5 atggtcatta 
ccccaaaaac 

atgggggaaa 

tatacctcag 
actggtagtc 

10 tcagataact 
ttaacttcag 
cagctctcaa 
tctcaactta 
gctaaaagaa 

15 ctaaatgaaa 
aacttggctg 
aaggaacaac 
caaggggaat 
caaacgattt 

20 gatcagtcaa 
aatrcaagctt 
gtaaatggta 
gcacaattaa 
gccaatgcct 

25 ggtgctacta 
agcggattga 
tctcagcagt 
cttgtgttaa 
tatatgattg 

30 tctttatctg 
aatggtttta 
tttgaagccc 
atggctcttg 
•>*gttatgttat 

35 gcatttttcc 
caaaccattt 
ctggcattta 



tgcctgcctt 
atgaccttcc 
tgacaattgg 
ttgtatcttc 
ctttacccga 
taactatccg 
cagcgatggc 
cagtttttag 
aagcattgga 
tagcaggttt 
gattaacggc 
ccgatatgcc 
atcagggtct 
tccaatcttt 
atctctcaac 
ccattgctca 
tagcagtctt 
taactgctgc 
taaggtctgt 
aacagttgga 
tgccgggagc 
gcttaaatca 
acaggtataa 
tgtcatccaa 
aactagctga 
ctgaattatc 
tatcgcttgt 
atgagaaaga 
ctgtttctct 
gtcgtccggt 
tttcgactat 
gttatggtat 
tcacagcttt 
tgcttcaggt 
aaaagttaca 
cattatcagg 
tggtattatc 



atataattta 
cattgctgtt 
gaatgatatg 
aaaaagtgct 
agatctttct 
ttaccaaact 
taagctgaaa 
cagtatgaca 
ttcaggagcg 
atctagtgct 
ctatacagct 
agtttacctg 
ttcacagttg 
ggaggtagga 
gatgcaggta 
agctgcgcaa 
acaagccact 
tcttacccaa 
tcaaactttg 
gcaacttaag 
aagttctgct 
acaagttcta 
tactgccatt 
gtccggagaa 
tggtagttct 
aacaggattg 
ttctgtgact 
caaagatggt 
aatggttgtg 
caaagataaa 
gggatccatt 
ggaaacctta 
ggtcggttgg 
tggctcttca 
tcctttctta 
tcatattgga 
actactcatt 



tcctttctag 
gttaatcatg 
gtggacaaga 
caaaaaggtc 
cagcgagcaa 
agtaaaggac 
gagtctgttt 
gacctccaat 
aagacagccc 
agttggcaat 
ggtgttagtc 
aatggggttt 
acacaatcaa 
ttaccagttt 
ccaaaactta 
cagttacttg 
agcgcttatc 
actgataagg 
tcaacaagtt 
gaagctgttg 
ttaactgaat 
ccaggaagta 
ggttctgggg 
ttactagatg 
caattgagtc 
tcaatcttaa 
gataaaaatg 
gttaagacga 
gccctttcaa 
tgggattggg 
gttctctact 
ggatttatta 
gatgatcgat 
ggtggctctt 
ccaatgactt 
gtagaagtga 
tatcgtccca 



gctcaatgtg 
ataagccfcgc 
tgtctaaaag 
ttaaaaaagg 
caaccttatt 
atggaatggt 
cgcaaaacat 
caggattaaa 
aaatgggtag 
ttcaacaagg 
aagtaaagga 
ctcggttatc 
caacactttc 
taaatcaagg 
ataccgatga 
ttaaagaagc 
agtcactaac 
gtgaagctgt 
tacagtctct 
cacagattgc 
tatcaacggg 
atcaattgac 
taataaaact 
gtagccatca 
agggtggtca 
atggttcctt 
ctaaagctgt 
atgggatcgg 
ccaacgtcat 
ctaaacaaaa 
tagctattca 
tgctaagtgg 
atggctcttt 
accccattga 
atgtggtatc 
aagtcttaac 
agaaaacagt 



ggatccttat 
aaagagagct 
taaagattta 
tgattattat 
aaatcccgaa 
cgctgctaag 
tacgaagact 
agaagcctca 
tcaaatgctc 
aactaatcgt 
cggactaggg 
acaaggagct 
tgatgataaa 
tattcagcaa 
gttagggaat 
tgctgctcac 
tgcfcgagcaa 
ggctcctgct 
ctctcaagaa 
taatcaatcc 
attagcaaag 
aacaggatta 
ctcagaaggt 
attatcagaa 
tcaattaacg 
agccaaagcc 
cgcaaaacct 
gatggcacct 
ttttgctaat 
atttgttatt 
attattaggg 
ttggacgttt 
tgcttctttg 
gttaagtgga 
tggtttacga 
tggtttctta 
ctaa 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2274 



40 



45 



50 



55 



60 



65 



<212> Type : DNA 

<211> Length, - 2274 

SequenceName : SEQ ID 757 
SequenceDescription : 

Sequence 



<213> OrganismName : Streptococcus pyogenes 
<400> PreSequenceString : 

atgagcagag acccaacata tacaataaac gagcacgact 
ttttatgtga ccfcttaaggc agataagtca agtgagactg 
cttggcaata ccataatcaa aaagctacag gtcgaggatg 
gtaaagccta aagttaccac tcaacaagct tttggactag 
gatttacagc taaaagaccc taagtcagat ttgtggggca 
gcaatgctag tcgagtacgc caacaaagag atgtcaagtg 
cagatattgt tacaagtcaa gtctattgat gatgaacgat 
ctgaatggta tcaaacaaac tgtcaaaagt gagtcagttg 
gcatcaatgt ttgatagtcg tattagtgga cttgatggca 
acaattgata gtcttagcag tcgtcttgat gatggtgttg 
caaaaggtaa gtggcattga tttacgagtt agtaatgcag 
tctcagacag cacaaggatt gcagtcacaa atcacaaatg 
ttgtctcaga ctgtacaggg actacaaaca actgtacgtg 
agtcggatta atcagttaag tgacctgatc agcacaaagg 
acaactattg ctcagagtta cgacaagata gccttcgcaa 
agcaaaatgt ctggcagtga gattatctcg gcaatcaatc 
atcactggaa aaaatatcac tcttgatggt aacagctaca 
gatgctcata ttgctaacat ggatgccggt aagattaata 
aggattgcaa ccgaggccat tactggtgag aaaattaaga 
aaactcactg ctaacgaggg atattttagg acgttgtttg 



strain MGAS823 2 



tatcttttgc 
tgagacttaa 
acaatacaat 
ctcagcaggt 
aaatcaagtt 
ccattgcgca 
attccaaatt 
aatccgcacg 
aatacagtcg 
gtaactactc 
ctaatgatgt 
caaaccaaaa 
ataatcaatc 
tatcaaaagg 
tcagggataa 
ttgataggtc 
tcagcaacgc 
ctggttatct 
tggactatgc 
ccaaagacat 



agatggtcgt 


60 


cagtagttgc 


120 


gcacgacttt 


180 


caaagagctt 


240 


caataataag 


300 


atcagctgag 


360 


tgagcaaact 


420 


tactcagcta 


480 


tttgagccaa 


540 


aacgctatct 


600 


ttctcgat tg 


660 


ttacagcagt 


720 


aaatgctaca 


780 


tgatgttgag 


840 


actcccagca 


900 


tggggttaaa 


960 


tgtcatcaaa 


1020 


taatgctaat 


1080 


cttttttaat 


1140 


ctttgcaaca 


1200 
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gaggtgtatt 
cctttaatcg 
atggcacaca 
cagcgttgta 
ccggtcgttt 
cagtcgacca 
atcgaggatt 
atgcggctgt 
ctgctcatag 
tttaa 



strain MGAS8232 



tcagtccaat ctgtaacact atcagctagc aaaattactg 
aatggggcaa gtcagtggga cctaaataat gccaatatga 
attaatttta atagcaaaaa caatgcctta gtacgtaaag 
gtacatttta gtaatgccac accaaaaggc tatagaggct 
5 ggcattacct catcaggaga tggcatcgac agcgcttctt 
aggtcattta ggtacgctac gggatataac catactgctg 
tacggtgata atgtcttgat tgcagatgac tttagcatca 
ccagacaaaa tggaaaaagt gctcgacatg aacgacttgt 
ggccgctgtt gggggcactt ggctaacgtc ggctggaata 
10 agtgctgtga gtagggaatt gaataactac atcactaaaa 
<212> Type : DNA 
<211> Length : 1785 

SequenceName : SEQ ID 758 

SequenceDe script ion - , , . 

15 

Sequence 

<213> OrganismName : Streptococcus pyogenes 
<400> PreSequenceString : 

20 atggcagctg atggtaaggt aacgatactt gttgacgttg atggtaagca 
ctcaatagtg agttagataa agttgccaag cacggtgaca aaggcagctc 
aaatttgcgg ttggtgcagg agtctttaaa ttagcttcgg ctgcagttga 
caatctcttg gcaaggctat cacaagattt gacacgcttg aaaaatatcc 
aaagctatgg ggcatagcgc tgaggatgtt gctagatcaa ctgataagtt 

25 attgatggac taccaacaac tttagacgag gttgtcggaa ccgctcaacg 
attactaagg atatcaataa atcaactaat ctcacactag cattaaataa 
gcttcaggag cttcatcaga ggctgcaagc cgagggctgg agcaatatgc 
tcagctggta aggttgatat gcaagcttgg aaaaccctcc aagaaacaat 
ttacaacaaa ctgcggaagc ttttggattt gcaggggcat cggctcaaaa 

30 gaggcgttaa aaaacgggca aataacattt gaccaatttt ctaataagtt 
aatgatggtg tcggcggttt tgcagaacta gccaaagaaa atagtaaagg 
tcttttaaca acatcaagaa cgctattgca aaaggtgtgg ccaatagcat 
gatgatttgt ctaaggctgc aacaggtaag ggcatagctg atcattttga 
•gttgttatca atgcctcttt tagcgccatc aatgcaagta ttaaagctag 

35 tttaaacttt tgtttagtgt tattggtgct ggaatatcag tcgtcaaagc 
gccttagttg gtgtagcatc tggtctagct gccatgaggg cagttaatga 
atgattaaag cgctaaatag agcttgggtt atggcatctg catcaatgag 
acaaccatta agactgtgac tgcggtacaa gcggtaagta ccacgatgac 
atggttgcaa gactatctca gttaggtgtc ttaaaagcca gtaccgtgat 

40 atgacaggcg ctatcagttt atctactgct gcaaccatag ccagtactgc 
gcgttaaaag cagcacttgt agccttaaca ggtcccgttg gttgggtagt 
ggtgctttag ttgctgtcgg agtaagctta tggtcatggc taactaaaga 
accaaaaagc tgaaaaaaga gcaggagggg ctagtcgaaa gcaacaaaca 
tctgtccgtg agggcgtgca agagcgtaag aagggccttg agtccgtcaa 

45 gcagctcatc aaaaattagc tgacgaaatc attaagttag ccgccaaaga 
gcaggcgaaa aacaaaactt aaaaaataag attgatcagc ttaatgggtc 
ttaaacttgg cctatgacaa aaactccaat tctctttctc acaatgcaga 
tcacgcatta gtgccatgga agcagaaagc acatggcaaa cagcacaaca 
aatattgaac agaaacgtag cgaggttagc aaaaagctag ctgaaaatgc 

50 aaaaagtgga atgaagaagc taacgtctcc gattccgtcc gaaaagaaaa 
ctcacagaag aagaagctaa acttaaaaat atgcagactc aactgcagga 
aagacatcag ctactcaaca agctgctgca gacgctatgg ctgccgctga 
tcagcaagac aggttatagc gtacgaaaat atgtcagaag ctcaacgaac 
aatatgcgca ctaagtactc tgaactttta gagacaacga catctatttt 

55 gaacaaaaga cggcattatc agtagatcaa atgaatacca accttgaaaa 
gctactgaac agtgggctac taatttggag attttggctc agcgtggtgt 
attttggagc aactaagacg catgggtcct gaaggagcca cacagacgca 
gatgcaacag atgccgagct agcacccttg caggaaaact ttagagcagc 
gctaaaaatg caatggggag cgttttagac tcagcaggtg tggaaatgcc 
60 aaagggatgg tcactaatgt ttctacggga ttacaggcgg aactgcaagc 
gctcaacttg gccaagaaat ccctaatggg gtttctcaag gtataagtca 
aaagcaagtg acgcaagtgt caaaatgggt caagaagtta aacgctcttt 
ttgggtatcc actcgccatc gcgagtattt actgagtacg gtggccatat 
ttgagtaatg gtgtgacaaa tggaacgtca aaagttatgc aaaccatgca 
65 caacagatgt ctcaaaaagg acagcagatt gttaatgaca tgcgtagcaa 
atcacagatg cttttagcac gatgagtggt ccaatgcact ctcatggtgt 
caaggtttgg ccaatggtat ttatgcaggg tcgggggcag ctttagcggc 



agccgctaca 
agatgccaca 
tactgctttt 
tgcctctatc 
tgcagggcta 
aaccgagcta 
taaatttaga 
agtagcctta 
caattttaca 



ggtaaaggta 

ctctcttaaa 

tttggttagt 

aagggtcatg 

agcgaacgga 

tttgacctct 

tgccttttta 

ccaaatgcta 

gccttatgcc 

ggatttttat 

gattgagtta 

gattgaaacc 

taaggctttg 

tagtttgaaa 

tacaccgcta 

tctgtcgcct 

gactataaca' 

tattggagca 

taaagcagat 

ttatggtgtt 

tgcggtaact 

tggagctatc 

gtcagacgag 

gctaagagat 

agagagcact 

aaacaaaact 

tattgatggc 

tcaaattaag 

aaatctgtta 

tgatttgcgt 

gattgcagaa 

ggagtataac 

agaatcagga 

tgccatagac 

tgatgctata 

aaatagagct 

ggatcaaggt 

agtttttgtg 

cacagaaact 

agaaaaagtt 

tgctaacttt 

aggggcaggt 

tcaaggagag 

tactgatggc 

gagcttggct 

gtcgaaccaa 

taatgccatg 

agctcaaagc 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1785 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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attgcggcac gtatcaccgc aacaattcaa agtgccttag atatccactc gccatctcgt 2940 
gttatgaggg atgaggttgg acgttttatc cctcagggta tcgctgtagg tattgatgcg 3 0 00 
gatagaaaag tcattgactc atctatgcaa aagctaaaag agtcaatgac gattaatgcg 3 060 
actccagaaa tagcctctgg atttggcgga ggagttgcgg ggattgctaa tcagaccaca 3120 
5 aataactcaa ataacagttt tacccttaat gtcaaggttg atgaatccga cggtaatagc 3180 
cacgagaaat atcaacgctt attcagagaa tttagctggt atattcaaca acaacaagga 3240 
aggttaggtg atgttaaatg a 3261 
<212> Type : DNA 
<211> Length ; 3261 
10 SequenceName : SEQ ID 759 

SequenceDescription : 

Sequence 

15 <213> OrganismName : Streptococcus pyogenes strain MGAS8232 t:. 
<4 00> PreSequenceString : 

atggctaaag aaccatggga agaaaaaatt gttgatgata ctatagggac acgaacacgt 60 
aaatcaagaa atgctttcat tagcacgcct tggttgactg ctttattaag tgtattcttt 12 0 

gtcatcattg ttgctatact ttttattttc ttctatacat caaatagcgg tagtaataga 18 0 

20 caagctgaaa caaatggttt ttatggagca tccactcata aaaaaacaag gaaagcttct 240 
aacgctaaaa aaacatcaag tagttcaaca actacagaca caacaccttc tagcgaagaa 3 00 

acacttgctt ctagtgaagg aaccggcgaa acccttactg tattggcagg tgagggggca 3 60 

gcttctattg cagctcgtgc aggtatttct gtggagcagt tacaagcact taatccagag 42 0 

cacatgactc aaggatattg gtatgccaat ccaggagatc aagtcactat taaataa 477 

25 

<212> Type : DNA 
<211> Length : 477 

SequenceName : SEQ ID 760 

SequenceDescription -. 

30 

Sequence 



. < <213> OrganismName r Streptococcus pyogenes strain MGAS8232 
, r ,<400> PreSequenceString t 

35 atgtctaaaa gaggaaaaat taaaataaca acgaaaacaa agcttattac agctagtgtt 60 

ataacgctag tattaattat aactggagta gtcttgtgga aacaacaaca aaatacgcta 120 

acagctgata tcgctaaaga accttactct actgttagtg taactgaagg gagtattgct 180 

tcttcgactt tactatcagg tactgtaaag gctttatcag aggaatatat ttattttgat 240 

gctaataaag gaaatgatgc aactgttaca gttaaaatag gtgatcaggt aacgcagggc 300 

40 cagcaattag ttcaatataa tacaacaaca gctcagtcag cttatgatac tgctgttagg 360 

agtcttaaca agattggccg acaaattaat catcttaaaa catacggagt tcctgctgtt 420 

agtacagaaa ctaataaaga tgaagctacc ggtgaagaga cgacaacaac agttcaacca 48 0 

tcagctcagc aaaatgctaa ttataaacag cagctgcaag atttaaatga tgcttatgca 540 

gatgcacaag cagaagtaaa taaagcgcag atagcgttaa atgatacagt agttatcagt 600 

45 agtgtctctg gaactgttgt ggaagtaaat aatgatattg atccttcttc aaagaacagt 660 

caaacacttg ttcacgtagc aaccgaagga cagcttcagg tgaaaggaac attgacagag 720 

tatgatttag caaacgttaa ggttggtcaa tctgtaaaaa ttaagtctaa agtttattct 780 

aatcaagaat ggactggaaa aatatcatat gtttcaaact atccaactga gtctaatgca 840 

ggttcaacaa cgccagcagg tagcactgga gcgggaagtt ctacaggagc tgcctatgat 900 

50 tacaagattg atattataag tcctcttaac cagcttaaac aaggtttcac tgtttctgtt 960 

gaggttgtta atgaagctaa acaggcctta gttcctttaa cggctgttat taagaaagat 102 0 

aaaaaacact atgtttggac ttatgatgat gctactggca aagccaaaaa agtagaggtg 10 80 

acacttggaa acgcagatgc acaacaacaa gaaattcata aaggagtagc tgttggtgac 1140 

attgttattg ccaatccaga taaaaatatc aaaccggata aaaaactaga aggggttatt 12 0 0 

55 tcaataggta ccaacacaaa accggaaaaa gattctcaat caaagaataa aaaatcaggg 1260 

gtggataaat ag 1272 
<212> Type : DNA 
<211> Length : 1272 

SequenceName : SEQ ID 761 

60 SequenceDescription : 

Sequence 



<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 
65 <400> PreSequenceString : 

gtgctccgct tgccaactgc gcgcgcgtgc attacaatgg gcaccatgat cagacataca 60 
tttacgcata ggtgcggagc gctcctgtgc gcgctggcgt tgggaagctc cactatggct 120 
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gcgaccgccg ctgcaaaacc caagaaaggg caaatgcaga aattgcggca gcggccggtg 180 

tgggcgccca ccggcgggcg gtatgcgtct ttggacggtg cgtttaccgc gctggcaaat 240 

gatgcaagtt tctttgaggc aaatccggca ggaagtgcga acatgacgca cggggagctg 3 00 

gctttcttcc ataccactgg ctttggctcg tttcacgccg aaacgctctc ttacgtfcggc 3 60 

cagtcgggca actggggata cggcgcgtcg atgcgtatgt ttttccctga atctgggttt 42 0 

gacttttcta ccaccacgga gcccgtgtgc acacctgctt cgaaccccat taagcagcgc 480 

ggggcaattg gaatcatcaa ctttgcccgg cgtatcggag gtctctccct gggagccaac 540 

ctgaaggcgg ggttccgcga cgcgcagggc ctgcagcaca cctctgtctc cagtgacatc 600 

ggcttgcagt gggtggggaa cgttgccaag tcctttacct ctgaagagcc caacctgtac 660 

atcgggcttg cggccaccaa cttgggattg accgtaaagg tctcggacaa gatagagaac 720 

tgcacgagta cctgtgaaaa gtgtggttgc tgcaaggaga ggtgctgctg caacggcaag 78 0 

aaggcgtgct gcaaggactg cgactgtaac tgcccctgtc aggactgcaa cgacaaaggt 840 

acggtgcacg caacagacac catgctgcgt gcagggtttg cataccggcc cttcagctgg 9 0' 

ttcctcttta gccttggtgc caccaccagc atgaatgtgc agaccttggc tagtagtgac 96°£ 

gccaagtcgc tgtaccagaa cctggcttac agcataggcg ccatgtttga tcccttcagc 102 0 

ttcctgagct tgagttcgag ctfcccgcatc aaccacaagg ctaacatgcg agtgggagtg 108 0 

ggtgcagagg cgcgcattgc ccgcattaag ctgaacgcgg gataccgctg tgacgtcagc 1140 

gacatcagca gtgggagtgg gtgcacaggc gcgaaggctt cgcactacct ttccttgggt 1200 

ggcgcgatac tgctcggccg aaattaa 1227 
<212> Type : DNA 
<211> Length : 1227 

SequenceName : SEQ ID 762 

SequenceDescription : 

Sequence 



<213> OrganismName : Treponema pallidum subsp. pallidum str. Nichols 
<400> PreSequenceString : 

atgagcagaa cgttccgcgc gtggcagtgc gttggtgcgc tgtgtgcgct ctctcccctg 60 

ctgcctgcct acagctccga gggcgtgcga gaggtacccc cctcccagtc tccgcaggtg 120 

gtggtggcgt acgagcccat tcgccccggg gatcagctgc tcaaaattgg cattgttgca 180 

ggctgccagt tgtacatagc agggggaaat ggaaccaacg gctcttcgag ttccggcacc 240 

aacggtaacg gcaacggcaa actgctcggg ggcggggggt ttcacctcgg gtacgagtat 300 

ttttttacca aaaacttttc cctcggcggg caagtttcct ttgagtgtta ccgcacgacc 360 

gggtcaaact atfcacttttc tgttcccatc acggtaaacc ccacgtacac gtttgccgta 420 

gggcgctggc gcataccgct ctccctgggc gttgggctca acattcagtc ctatctcagc 480 

aagaaggcgc cggggcttafc tgcggaagcc agcgcggggc tctactacca gtacaccccg 540 

gactggtcca tcggcggcat tgttgcctac acgcagcttg gggacattgc aagctccccc 600 

gacaagtgca gagccgtggg ccttgccacc attgactttg gggtgcgcta tcacttttag 660 

<212> Type : DMA 
<211> Length : 660 

SequenceName : SEQ ID 763 

SequenceDescription : 
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